Product of novel fructose polymers in embryos of transgenic plants

ABSTRACT

This invention includes a recombinant DNA molecule comprising a sequence for a fructosyltransferase capable of producing a novel fructan, a method for producing transgenic plants exhibiting a novel fructan, transformed plants and plant parts comprising said novel fructan, and products prepared therefrom.

[0001] This application claims the benefit of U.S. Provisional Application No. 60/404,844 filed Aug. 21, 2002. The entire content of the provisional application is herein incorporated by reference for all purposes.

FIELD OF INVENTION

[0002] This invention relates to the field of plant molecular biology. The present invention includes methods for producing transgenic plant species showing a novel fructan profile, transformed plants or plant parts showing said novel fructan profile, and products prepared therefrom.

BACKGROUND OF INVENTION

[0003] The major reserve carbohydrates found in vascular plants are sucrose, starch, cellulose and fructan. Sucrose is most commonly purified from sucrose-producing plants and used as a sweetener. Starch and cellulose are currently used in numerous food and non-food applications in their native form, but their usefulness is greatly expanded by enzymatic or chemical modification. Fructan has commercial applications in the industrial, medical, food and feed industries.

[0004] Fructan includes linear or branched polymers of repeating fructose residues connected by β2-1 and/or β2-6 fructosyl-fructose linkages, optionally including one terminal glucosyl unit. The number of residues contained in an individual fructan polymer is also known as the degree of polymerization, or DP, and varies greatly depending on the source from which the fructan is isolated. For example, fructan isolated from fungal species, such as Aspergillus syndowi, may contain two or three fructose residues, fructan obtained from plants have low to intermediate DP (3 to 200), and fructan found in bacteria, such as Bacillus amyloliquefaciens or Streptococcus mutans, have a DP of 5,000 or greater.

[0005] Fructan accumulation in plants is highly sensitive to environmental changes. Exposure to drought or frost dramatically alters the quality of the fructan accumulated (Praznik and Beck (1987) Agr. Biol. Chem. 51:1593-1599). Traditional breeding programs could, in theory, result in varieties with reduced quality losses due to environmental changes. However, programs of this type are very time consuming, are not in place at this time, and would likely be implemented only when the fructan industry proves them to be viable.

[0006] The ability to produce fructan of the desired size in large amounts in crops of agronomic importance, such as corn and/or soybean, will reduce fructan production costs. Fructan production in corn for example, allows the utilization of the corn byproducts (oil, meal and gluten feed) in addition to removing the costs of converting glucose to fructose. Hydrolysis of fructan into individual fructose residues results in a product consisting of at least 99% fructose. This highly pure product provides an alternative to the inefficient isomerization step, usually used to convert glucose isolated from starch to fructose, and eliminates the need for fructose enrichment by ion exchange chromatography. Crystallization of fructose is simplified by starting with material that consists of 99%(+) fructose. Availability of fructose at a competitive cost would allow it, easily dehydrated to 5-hydroxymethyl-furfural (HMF), to be utilized as a building block for pharmaceuticals, such as ranitidine and Zantac®. HMF may also be used as starting material for polymers such as Kevlar®, and Nomex®, in addition to the potential for use in opto-electronic devices, due to the special optical effects of the furan nucleus (Schiweck et al. (1992) in Carbohydrates as Organic Raw Materials, Lichtenthaler ed., VCH Press, NY, pp. 72-82). HMF may be converted into carbocyclic and heterocyclic compounds, creating a role in almost every part of applied chemistry, if only its purity could be combined with increased production and reduced cost.

[0007] The fructan produced in plants differ structurally depending on the linkages of the fructosyl residues. Linear β2-1 linkages of fructose residues form inulin(s) found in chicory, sunflower, and Jerusalem artichoke, among others. Linear β2-6 linkages of fructose residues form levan(s) found in some grasses. Mixed levans, also called graminans, have a mixture of β2-1 and β2-6 linkages and are found for example in wheat and barley. Fructose residues connected by β2-1 and β2-6 linkages on carbons 1 and 6 of the glucose moiety of the sucrose molecule form the inulin neoseries. These fructan are found in onion, leek, and asparagus, for example. The levan neoseries contain predominantly β2-6 linkages of fructosyl residues on either end of the glucosyl moiety of the sucrose molecule and are found in oat, for example.

[0008] Several models have been proposed for the formation of the different plant fructan. In one of these models (Vijn, I. and Smeekens, S. (1999) Plant Phys. 120:351-359) conversion of sucrose to 1-kestose (also called isoketose) is catalyzed by sucrose:sucrose 1-fructosyltransferase (1-SST; EC 2.4.1.99) and conversion to 6-kestose is catalyzed by sucrose:fructan 6-fructosyltransferase (6-SFT; EC 2.4.1.10). Elongation of 6-kestose to levans is also catalyzed by 6-SFT. Addition of a fructosyl unit from sucrose to 1-kestose produces neokestose when catalyzed by fructan:fructan 6G-fructosyltransferase (6G-FFT) and produces bifurcose when catalyzed by 6-SFT. Conversion of neokestose to the levan neoseries and of bifurcose to mixed-type levans is also catalyzed by 6-SFT. This last conversion has been suggested to be by the action of exohydrolase or fructan:fructan 1-fructosyltransferase (1-FFT, EC 2.4.1.100). 1-FFT catalyzes the production of inulin, inulin neoseries, and mixed type levans.

[0009] In contrast to fructan-producing plants, which use at least two fructosyltransferases (FTFs) to produce fructan of low to intermediate DP (DP3 to 200), bacteria produce high-DP fructan (DP>5000) using a single FTF. Differences in the specificity for donor and acceptor molecules have also been reported for bacterial and plant FTFs. The bacterial enzymes are known to have a hydrolase activity by which water is used as a fructosyl acceptor resulting in the release of significant amounts of fructose from sucrose. This hydrolase activity is referred to as invertase activity. Hydrolytic activities have been suggested for some SSTs (Koops and Jonker (1996) Plant Physiol. 110:1167-1175) but have yet to be found in FFTs (Chambert, R. and Petit-Glatron, M. (1993) in Inulin and Inulin Containing Crops, A. Fuchs ed. Elsevier Press, Amsterdam. pp. 259-266).

[0010] Fructose, liberated from sucrose by invertase activity, cannot be used to increase the length of a fructan polymer. Bacterial FTFs, therefore, convert sucrose to fructan less efficiently than do the plant enzymes. Plant and bacterial FTFs also differ in their affinity for sucrose, the sole substrate. Jerusalem artichoke SST has a Km for sucrose reported to be approximately 100 mM (Koops, A. and Jonker, H., (1994) J. Exp. Bot. 45:1623-1631). By contrast, the bacterial enzyme has a much lower Km of approximately 20 mM (Chambert, R. and Petit-Glatron, M. (1991) Biochem. J. 279:35-41). This difference may have a critical effect on fructan synthesis, resulting in higher or lower levels of accumulation, depending on the concentration of sucrose in the cell.

[0011] In an attempt to produce fructan in crops of agronomic importance, transgenic plants expressing bacterial FTFs have been produced. Oligosaccharides and fructan are produced in differing amounts when the FTF is expressed under control of a constitutive promoter, an endosperm specific promoter, or a tuber specific promoter and is directed to different subcellular locations. Direction to the vacuole, the chloroplast, and the endosperm proved to be the most efficient in producing levan in transgenic plants. In PCT publication No. WO 89/12386, published Dec. 28, 1989, FTF activity was found in transgenic tomato plants prepared expressing B subtilis levansucrase under the control of the mas promoter and having an apoplast-signal sequence, or expressing the L. mesenteroides dextransucrase under the control of the constitutive cauliflower mosaic virus 35S (CaMV 35S) promoter. While potato and tobacco plants expressing E. amylovora levansucrase under the control of the CaMV 35S promoter did not produce detectable fructan when the enzyme was expressed in the cytoplasm, direction of the enzyme to the apoplasm or the tubers produced detectable levans (PCT publication No. WO 94/04692, published Mar. 3, 1994). PCT publication No. WO 94/14970, published Jul. 7, 1994, shows that expression in tobacco or potato of B. subtilis SacB or S. mutans ftf under the control of the CaMV 35S promoter resulted in the production of bacterial-like fructan regardless of the subcellular location of the enzyme (vacuole, apoplast, or cytoplasm). Production of fructan by means of the S. mutans ftf gene in tobacco resulted in only very low amounts of DP3. Smeekens, J. C. et al. (PCT publication No. WO 96/01904 published Jan. 25, 1996).

[0012] The results using B. amyloliquefaciens SacB in transgenic tobacco, potato, or corn plants are not consistent as set forth in PCT publication No. WO 95/13389, published May 18, 1995. In tobacco, expression of the bacterial FTF under the control of the SSU promoter and directed to the cytoplasm proved to be detrimental to the plants, expression under the control of the inducible 2-1.3 promoter produced small amounts of fructan when directed to the chloroplast and produced small amounts of enzyme when directed to the vacuole. In the same publication it was shown that when the bacterial FTF is expressed under the control of the tuber-specific patatin promoter and directed to the cytoplasm no detectable levels of fructan are produced, yet detectable levels of fructan are produced when the enzyme is directed to the vacuole. Furthermore, the same publication shows that expression in corn of the bacterial FTF under the control of the endosperm-specific 10 kD zein promoter produced fructan when directed to the cytoplasm or the vacuole of both, dent maize and starch mutant corn lines.

[0013] After inhibition of starch production, due to expression of the antisense ADP-glucose pyrophosphorylase gene, targeted expression of the Erwinia amylovora levansucrase to the apoplasm, vacuole, or cytosol of potato yields varied results as set forth in PCT publication No. WO 94/04692 published Mar. 3, 1994. WO 94/04692 shows fructan accumulating to an appreciable level only in plants where the transgene was targeted to the apoplasm or the vacuole. Expression of the levansucrase in the apoplasm resulted in smaller tubers while its expression in the vacuole did not change the tuber morphology. The fructan produced in the transgenic plants had similar characteristics to the fructan naturally produced by the bacteria. Targeting of SacB gene of B. subtilis to the plastid resulted in fructan accumulation over 10% dry weight in tobacco and up to 5% dry weight in potatoes grown during winter. The fructan produced in these transgenic plants are believed to be associated with the starch granules (PCT publication No. WO 97/29186 published Aug. 14, 1997). Transgenic potato expressing an FTF from S. mutans under the patatin promoter has been reported to produce inulin, useful for industrial applications (PCT publication No. WO 97/42331 published Nov. 13, 1997).

[0014] Transformation of plants with DNA sequences encoding plant FTFs have also been reported. Production of fructan using plant derived FTFs in transgenic dicots has been successful to a limited extent in tobacco, petunia and potato. Leaves of transgenic petunia plants, expressing Jerusalem artichoke 1-SST, and leaves of transgenic potato plants expressing the Jerusalem artichoke 1-SST and 1-FFT, produced only small amounts of tri- tetra-, and penta-saccharides. The tri- and tetra-saccharides are detectable by thin layer chromatography while the penta-saccharides are detectable only by HPAEC analyses (PCT publication No. WO 96/21023 published Jul. 11, 1996). Transgenic petunia plants expressing Jerusalem artichoke 1-SST and 1-FFT produced fructan molecules of DP up to 25 (van der Meer, I. M., et al. (1998) Plant J. 15:489-500). Smeekens, J. C. et al. (PCT publication No. WO 96/01904 published Jan. 25, 1996) suggest preparing transgenic plants with sequences encoding onion SST, barley 6-SFT, Jerusalem artichoke FFT, potato FFT, or mutants of these genes to produce fructan for use as sweeteners. Sequences encoding onion SST, barley 6-SFT, and potato FFT were identified, the cDNAs encoding the FTFs were isolated and used for the preparation of transformation vectors which were then used to create transgenic plants. No data was reported with regard to the amount of the fructan made.

[0015] Transgenic potato expressing the artichoke (Cynare scolymus) SST under the direction of the patatin B33 promoter or the CaMV 35S promoter resulted in transgenic plants producing limited amounts of the DP-3,1-kestose (PCT publication No. WO 98/39460 published Sep. 11, 1998). Transgenic potato plants expressing globe artichoke (Cynara scolymus) 1-SST and 1-FFT produced inulin molecules of DP>60, similar to the inulin profile found in the globe artichoke. Total fructan accumulation in the tubers averaged around 35 μmol/g fresh weight artichoke (Hellwege, E. M., et al. (2000) Proc. Natl. Acad. Sci. U.S.A. 97:8699-8704; PCT publication No. WO 99/24593 published May 20, 1999).

[0016] Production of fructan using plant derived FTFs in transgenic monocots has been successfully accomplished in maize. Targeting the fructan synthesizing enzymes 1-SST and 1-FFT of Jerusalem artichoke to the vacuole of maize endosperm (PCT publication No WO 99/46395 published Sep. 16, 1999) resulted in production of low amounts of inulin-type fructan.

[0017] As can be seen above efforts to express fructan in plant species in which they are not ordinarily produced at high levels show varying levels of success, as measured by the level of fructan obtained. Aside from the particular fructosyltransferase used, there are three major variables related to the expression system: the plant species, the organ (seed, tuber, leaf, throughout the plant) and intracellular location (vacuole, plastid, cytoplasm, apoplast) in which or to which the fructosyltransferases are expressed or targeted. This variation in observed fructan level presumably relates in large part to the biological and physiological complexities of the different systems used. For example, seeds are a natural storage organ of plants and thus may initially seem a likely place to attempt to accumulate fructan. Seeds contain high levels of proteins, carbohydrates (usually starch), and lipids that are stored for use by the young plant upon germination. It is these storage products that give seeds, when used as grain, their economic value. But seeds are not uniform across species. The relative levels of the different types of storage products varies; see for example Table 19.2, page 1029 of Biochemistry and Molecular Biology of Plants (B. Buchanen, W. Gruissem, and R. Jones, American Society of Plant Physiologists, 2000; hereafter BMBP). Corn has very high levels of carbohydrates, a pattern that approximately holds for the major cereal crops. Soybean, a dicot, has little soluble carbohydrate and is high in protein, with moderate amounts of oil. The soluble carbohydrate pool in soybean seeds is comprised mainly of raffinose family oligsaccharides (39%) and sucrose (54%). Starch accounts for less than 1% of the mature seed dry weight (dry wt), and hexose sugars are barely detectable (Yazdi-Samadi B, et al. (1977), Agr. J. 69:481-486). Potato tubers, another storage organ, store mainly starch. This difference is perhaps reflective of physiological differences. Potato tuber initiation, growth, and development are characterized by significant modifications in hexose and sucrose concentrations and in the ratios of hexose:sucrose and glucose:fructose, but during the main period of reserve product accumulation the hexose levels are in general higher than in soybean. In contrast to both these examples, other dicots such as rapeseed have higher oil than protein levels in their seed. While not intending to be bound by any theory or theories of operation, these differences in storage reserve composition and underlying physiology may be related to differential expression of large numbers of genes at different times in development, as shown by Ruuska, S. A., et al. ((2002) Plant Cell 14:1191-1206) in the model system Arabidopsis. Plants do not compensate the turning off, by mutation or transgenic means, of the pathway for one reserve component by producing higher levels of another component. That fact, and the differences in storage compounds between species, make the introduction of genes encoding enzymes in the pathway for a different storage reserve product not usually found in a species, for example fructan, unpredictable between different species. It is not guaranteed that appropriate metabolic precursors for the desired product will be available.

[0018] Seeds have the added complexity that they are genetically non-uniform as they result from a unique double fertilization event. When the pollen tube reaches the ovule, two sperm cells are released. One fertilizes the egg cell, giving rise to the zygote. The second sperm cell fertilizes a unique structure called the central cell, which is diploid. The resulting triploid fertilization product gives rise to the triploid endosperm. (BMBP page 1022). This endosperm undergoes different fates in different plant species. In most dicots the endosperm serves a transient role and is much reduced or even essentially gone in the mature seed. By contrast, in many monocots, in particular the cereal crops, the endosperm is the main storage organ of the seed and, by weight, forms the larger part of the seed. Completely different genes are expressed in the endosperm and embryo in cereals such as corn. For example, while neither seed portion is rich in proteins, the endosperm seed proteins are of the prolaminin class (zeins), while the embryo contains globulins. Promoters of these two classes of proteins thus can be used to direct expression in one part of the seed or the other. In fact protein is a minor component; the endosperm is rich in starch; the embryo is rich in oil, however, the embryo is so small relative to the endosperm that corn grain is overall much richer in starch. One wishing to express a novel storage product in corn seeds thus must decide whether to express the relevant genes in the embryo or endosperm. As the endosperm comprises the larger part of the seed, endosperm specific promoters have often been chosen, but the differences in physiology between the two parts of the seed mean that this is not necessarily always the best choice. The present invention demonstrates that surprisingly high levels, on a seed basis, of fructan is obtained by expression of fructosyltransferases in the embryo instead of the endosperm of corn seeds.

[0019] In dicot species, where the endosperm is much reduced or degraded completely, expression of exogenous genes is generally done in the embryo. The embryo develops from the zygote. The developing embryo soon itself develops different tissues and organs (described in BMBP, pp 1024 et seq.). The embryo axis contains the root and shoot meristems that will eventually form the new plant. The cotyledons (called the scutellum in monocots) serve as the storage organs of the embryo, serving the role fulfilled by the endosperm in the cereals. The present invention also demonstrates that fructan is obtained by expression of fructosyltransferases in soybean embryos.

SUMMARY OF INVENTION

[0020] The present invention includes a plant and plant part comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to at least a portion of at least one coding sequence for a plant fructosyltransferase, operably linked to a vacuole targeting sequence, said molecule sufficient to express a protein capable of producing fructan having a degree of polymerization of at least three, in an embryo of the plant, or any progeny thereof, wherein the progeny comprise said molecule.

[0021] The present invention also includes a recombinant DNA molecule comprising an embryo specific promoter operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, operably linked to a vacuole targeting sequence, the molecule sufficient to express a protein capable of producing fructan in an embryo cell.

[0022] Another embodiment of the present invention is a method of producing fructan in a plant comprising constructing at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, transforming a plant with said construct, regenerating the plant to produce seed, harvesting seed from the plant, and extracting fructan from the seed.

[0023] Yet another embodiment of the present invention is a method of screening transgenic maize tissue expressing embryo targeted genes comprising preparing Type-II maize callus for transformation, transforming callus, selecting transgenic callus lines, regenerating transgenic somatic embryos, and propagating transgenic somatic embryos for both plant production and early trait analyses.

[0024] The present invention also includes a foodstuff comprising fructan obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, the molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of the plant, or any progeny thereof, wherein the progeny comprise the molecule.

[0025] Yet another embodiment of the present invention is a foodstuff comprising an inulin obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, the molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of the plant, or any progeny thereof, wherein the progeny comprise the molecule.

[0026] The present invention also includes an industrial product comprising fructan obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, the molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of the plant, or any progeny thereof, wherein the progeny comprise the molecule.

[0027] Yet another embodiment is grain from the plant of the present invention. Foodstuffs produced by the grain of the plant of the present invention is another embodiment.

BRIEF DESCRIPTION OF FIGURES AND SEQUENCE LISTING

[0028] The invention can be more fully understood from the following detailed description and the accompanying Figures and Sequence Listing which form part of this application.

[0029]FIG. 1 depicts a diagram of the GLOBSST01(f) cassette used to express the Jerusalem artichoke SST in transgenic maize embryos. The cassette contains an embryo-specific globulin promoter, a polynucleotide fragment comprising an entire Jerusalem artichoke SST coding region (including the native secretory and vacuolar targeting signals), and a nos 3′ transcription termination region.

[0030]FIG. 2 depicts a diagram of the GLOBFFT10(f) cassette used to express the Jerusalem artichoke FFT in transgenic maize embryos. The cassette contains an embryo-specific globulin promoter, a polynucleotide fragment comprising an entire Jerusalem artichoke FFT coding region (including the native secretory and vacuolar targeting signals), and the nos 3′ transcription termination region.

[0031]FIG. 3 shows the carbohydrate profile resulting from HPAE/PAD analysis of maize somatic embryos not containing cassettes expressing embryo-specific Jerusalem artichoke SST or FFT. nC=nanoCoulombs.

[0032]FIG. 4 shows the carbohydrate profiles resulting from HPAE/PAD analysis of transgenic maize somatic embryos containing intact copies of the GLOBSST01(f) cassette. S=sucrose, Rf=raffinose, DP3=1-kestose and DP4=1-kestotetraose (inulin-type fructose polymers), DP=degree of polymerization. nC=nanoCoulombs.

[0033]FIG. 5 shows the carbohydrate profile resulting from HPAE/PAD analysis of transgenic maize somatic embryos containing intact copies of the GLOBSST01(f) and GLOBFFT01(f) cassettes. S=sucrose, Rf=raffinose, DP3 through DP7=inulin polymers containing 1 or more fructose residues. nC=nanoCoulombs.

[0034]FIG. 6 shows the carbohydrate profile resulting from HPAE/PAD analysis of individual maize kernels not containing cassettes expressing embryo-specific Jerusalem artichoke SST or FFT. nC=nanoCoulombs.

[0035]FIG. 7 shows the carbohydrate profile resulting from HPAE/PAD analysis of individual kernels from transgenic maize lines containing intact copies of the GLOBSST01(f) cassette. S=sucrose, Rf=raffinose, DP3 through DP7=inulin polymers containing 1 or more fructose residues. nC=nanoCoulombs.

[0036]FIG. 8 shows the carbohydrate profile resulting from HPAE/PAD analysis of individual kernels from transgenic maize lines containing intact copies of the GLOBSST01(f) and the GLOBFFT01(f) cassettes. S=sucrose, Rf=raffinose, DP3 through DP10=inulin polymers containing 1 or more fructose residues nC=nanoCoulombs.

[0037]FIG. 9 depicts a diagram of vector pJMS02 used to express the guayule SST in transgenic soybean embryos. The vector comprises two expression cassettes. One cassette contains a KTi 3 promoter driving the expression of the entire guayule SST coding region (including the native secretory and vacuolar targeting signals) followed by a KTi 3′ transcription terminator. The other cassette contains the T7 promoter driving the expression of HPT, followed by the E. coli T7 RNA polymerase transcription termination signal.

[0038]FIG. 10 depicts a diagram of vector pRM03 used to express the guayule 1-FFT in transgenic soybean embryos. The vector comprises two expression cassettes. One expression cassette contains an embryo-specific KTi 3 promoter directing the expression of an entire guayule 1-FFT (including the native secretory and vacuolar targeting signals) followed by a KTi 3′ transcription termination region. The other cassette contains the T7 RNA polymerase promoter directing the expression of HPT followed by a T7 transcription termination signal.

[0039]FIG. 11 depicts a diagram of vector pJMS01 used to express the guayule 1-FFT in transgenic soybean embryos. The vector comprises three expression cassettes. One cassette contains an embryo-specific β-conglycinin promoter directing the expression of an entire guayule 1-FFT coding region (including the native secretory and vacuolar targeting signals) followed by a phaseolin 3′ transcription termination region. Another cassette contains the bacterial T7 RNA polymerase promoter directing the expression of HPT followed by the T7 transcription terminator region. The third cassette contains the CaMV 35S promoter directing the expression of HPT followed by the nos 3′ transcription terminator.

[0040]FIG. 12 depicts a diagram of vector pRM02 used to express the guayule SST in transgenic soybean embryos. The vector comprises three expression cassettes. One cassette contains an embryo-specific β-conglycinin promoter driving the expression of an entire guayule SST coding region (including the native secretory and vacuolar targeting signals) followed by a phaseolin 3′ transcription termination region. The two other cassettes contain polynucleotide fragments encoding HPT, one under the control of the bacterial T7 RNA polymerase promoter and transcription terminator regions, and one under the CaMV 35S promoter and nos 3′ transcription terminator.

[0041]FIG. 13 depicts a diagram of vector pRM01 used to express guayule 1-SST and 1-FFT in transgenic soybean embryos. The vector comprises four expression cassettes. One cassette contains an embryo-specific KTi 3 promoter driving expression of an entire guayule SST coding region followed by a KTi 3′ transcription termination region. Another cassette contains an embryo specific β-conglycinin promoter driving expression of an entire guayule 1-FFT coding region followed by a phaseolin 3′ transcription termination region. The other two cassettes contain polynucleotide fragments encoding HPT, one under the control of the bacterial T7 RNA polymerase promoter and transcription terminator regions, and one under the CaMV 35S promoter and nos 3′ transcription terminator.

[0042]FIG. 14 depicts a diagram of vector pRM04 used to express the guayule 1-SST and 1-FFT in transgenic soybean embryos. The vector comprises four expression cassettes. One cassette contains an embryo specific β-conglycinin promoter directing expression of an entire guayule SST coding region followed by the phaseolin 3′ transcription termination region. Another cassette contains the embryo-specific KTi 3 promoter driving expression of an entire guayule 1-FFT coding region followed by a KTi 3′ transcription termination region. The other two cassettes contain polynucleotide fragments encoding HPT, one under the control of the bacterial T7 RNA polymerase promoter and transcription terminator regions, and one under the CaMV 35S promoter and nos 3′ transcription terminator.

[0043]FIG. 15 shows the carbohydrate profile resulting from HPAE/PAD analysis of transgenic soybean somatic embryos transformed with expression vectors pRM02 and pRM03 containing the guayule 1-SST and 1-FFT coding sequences. S=sucrose, Rf=raffinose, St=stachyose. DP3 through DP5=inulin polymers containing 1 or more fructose residues. nC=nanoCoulombs.

[0044]FIG. 16 shows the carbohydrate profile resulting from HPAE/PAD analysis of soybean somatic embryos not transformed with cassettes expressing guayule SST or FFT coding sequences. nC=nanoCoulombs.

[0045]FIG. 17 shows the carbohydrate profile resulting from HPAE/PAD analysis of dried-down soybean somatic embryos transformed with expression vector pRM01 containing nucleotide sequences encoding guayule 1-SST and 1-FFT. nC=nanoCoulombs.

[0046]FIG. 18 shows the carbohydrate profile resulting from HPAE/PAD analysis of individual soybean mature seeds transformed with expression vector pRM01 containing nucleotide sequences encoding guayule 1-SST and 1-FFT. nC=nanoCoulombs.

[0047]FIG. 19 shows the carbohydrate profile of soybean seeds not containing nucleotide sequences encoding guayule 1-SST and 1-FFT. nC=nanoCoulombs.

[0048] The following sequence descriptions and the Sequence Listing attached hereto comply with the rules governing nucleotide and/or amino acid sequence disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825.

[0049] SEQ ID NO:1 is the polynucleotide sequence of plasmid vector GLOBSST01 comprising the GLOBSST01(f) cassette used to express Jerusalem artichoke SST in transgenic maize embryos. The cassette contains an embryo-specific globulin promoter directing the expression of an entire SST coding region (including the native secretory and vacuolar targeting signals) followed by a nos 3′ transcription termination region.

[0050] SEQ ID NO:2 is the polynucleotide sequence of plasmid vector pGLOBFFT01 comprising the GLOBFFT01(f) cassette used to express Jerusalem artichoke FFT in transgenic maize embryos. The cassette contains an embryo-specific globulin promoter directing the expression of an entire FFT coding region (including the native secretory and vacuolar targeting signals) followed by the nos 3′ transcription termination region.

[0051] SEQ ID NO:3 is the nucleotide sequence of pDETRIC, a polynucleotide fragment containing the bar gene under the control of the CaMV 35S promoter and OCS 3′-end and used to co-transform maize together with pGLOBFFT01(f) and/or pGLOBSST01(f).

[0052] SEQ ID NO:4 is the nucleotide sequence of oligonucleotide primer SST-1 used for detection of the Jerusalem artichoke SST in transformed tissue.

[0053] SEQ ID NO:5 is the nucleotide sequence of oligonucleotide primer SST-2 used for detection of the Jerusalem artichoke SST in transformed tissue.

[0054] SEQ ID NO:6 is the nucleotide sequence of oligonucleotide primer FFT-1 used for detection of the Jerusalem artichoke FFT in transformed tissue.

[0055] SEQ ID NO:7 is the nucleotide sequence of oligonucleotide primer FFT-2 used for detection of the Jerusalem artichoke FFT in transformed tissue.

[0056] SEQ ID NO:8 is the nucleotide sequence of the oligonucleotide primer SST-3 used for the PCR amplification of the polynucleotide fragment encoding guayule SST from clone epb3c.pk007.n11.

[0057] SEQ ID NO:9 is the nucleotide sequence of the oligonucleotide primer SST-4 used for the PCR amplification of the polynucleotide fragment encoding guayule 1-SST from clone epb3c.pk007.n11.

[0058] SEQ ID NO:10 is the nucleotide sequence corresponding to the entire cDNA insert in clone epb3c.pk007.n11 encoding an entire guayule 1-SST including secretory and vacuolar signals.

[0059] SEQ ID NO:11 is the nucleotide sequence of the oligonulcleotide primer FFT-3 used for the PCR amplification of the polynucleotide fragment encoding Guayle 1-FFT from clone epb3c.pk007.j9.

[0060] SEQ ID NO:12 is the nucleotide sequence of the oligonucleotide primer FFT-4 used for the PCR amplification of the polynucleotide fragment encoding guayule 1-FFT from clone epb3c.pk007.j9.

[0061] SEQ ID NO:13 is the nucleotide sequence corresponding to the entire cDNA insert in clone epblc.pk007.j9 encoding an entire guayule FFT including secretory and vacuolar signals.

[0062] SEQ ID NO:14 is the nucleotide sequence of vector pJMS02 comprising a cassette expressing the guayule SST under control of the embryo-specific KTi 3 promoter and transcription termination regions and a cassette comprising a fragment encoding HPT under control of the E. coli T7 promoter and terminator region.

[0063] SEQ ID NO:15 is the nucleotide sequence of vector pRM03 comprising the embryo-specific KTi 3 promoter directing the expression of an entire guayule FFT (including the native secretory and vacuolar targeting signals) followed by a KTi 3′ transcription terminator and the E. coli T7 RNA polymerase promoter directing the expression of HPT followed by a T7 transcription terminator.

[0064] SEQ ID NO:16 is the nucleotide sequence of the linker fragment used to introduce sites into the modified plasmid pKS17. In a 5′ to 3′ orientation, this linker fragment contains restriction sites for Asc I, Hind III, Bam HI, Sal I, Asc I.

[0065] SEQ ID NO:17 is the nucleotide sequence of vector pJMS01 comprising three expression cassettes. One cassette contains the embryo-specific β-conglycinin promoter operably linked to a polynucleotide fragment encoding an entire guayule FFT coding region (including the native secretory and vacuolar targeting signals) followed by a phaseolin 3′ transcription terminator. Another cassette contains the E. coli T7 RNA polymerase promoter operably linked to a polynucleotide encoding HPT, which is operably linked to the E. coli T7 transcription terminator. A third cassette contains the CaMV 35S promoter operably linked to a polynucleotide encoding HPT, which is operably linked to the nos 3′ transcription terminator.

[0066] SEQ ID NO:18 is the nucleotide sequence of vector pRM02 comprising three expression cassettes. One cassette contains the embryo-specific β-conglycinin promoter operably linked to the polynucleotide encoding an entire guayule SST coding region (including the native secretory and vacuolar targeting signals) followed by a phaseolin 3′ transcription terminator. Another cassette contains the E. coli T7 RNA polymerase promoter operably linked to a polynucleotide encoding HPT, followed by the E. coli T7 transcription terminator. A third cassette contains the CaMV 35S promoter operably linked to a polynucleotide encoding HPT, which is operably linked to the nos 3′ transcription terminator.

[0067] SEQ ID NO:19 is the nucleotide sequence of vector pRM01 comprising four expression cassettes. One cassette expresses guayule SST under control of the embryo-specific KTi 3 promoter and transcription terminator. Another cassette expresses guayule FFT under control of the embryo-specific β-conglycinin promoter and phaseolin transcription terminator. The other two cassettes express HPT, one under control of the bacterial T7 RNA promoter and one under control of the CaMV 35S promoter.

[0068] SEQ ID NO:20 is the nucleotide sequence of vector pRM04 comprising four expression cassettes. One cassette expresses guayule FFT under control of the embryo-specific KTi 3 promoter and transcription terminator. Another cassette expresses guayule SST under control of the embryo-specific β-conglycinin promoter and phaseolin transcription terminator. The other two cassettes express HPT, one under control of the bacterial T7 RNA promoter and one under control of the CaMV 35S promoter.

[0069] The Sequence Listing contains the one letter code for nucleotide sequence characters and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB standards described in Nucleic Acids Res. 13:3021-3030 (1985) and in the Biochemical J. 219 (No. 2):345-373 (1984) which are herein incorporated by reference. The symbols and format used for nucleotide and amino acid sequence data comply with the rules set forth in 37 C.F.R. §1.822.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE INVENTION

[0070] Definitions

[0071] In the context of this disclosure, a number of terms should be utilized.

[0072] The terms “polynucleotide”, “polynucleotide sequence”, “nucleic acid sequence”, and “nucleic acid fragment”/“isolated nucleic acid fragment” are used interchangeably herein. These terms encompass nucleotide sequences and the like. A polynucleotide may be a polymer of RNA or DNA that is single- or double-stranded, that optionally contains synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA, synthetic DNA, or mixtures thereof.

[0073] The term “isolated” refers to materials, such as nucleic acid molecules and/or proteins, substantially free or otherwise removed from components that normally accompany or interact with the materials in a naturally occurring environment. Isolated polynucleotides may be purified from other nucleic acid sequences, such as and not limited to chromosomal and extrachromosomal DNA and RNA, in a host cell in which they naturally occur, for example. Conventional nucleic acid purification methods known to skilled artisans may be used to obtain isolated polynucleotides. The term also embraces recombinant polynucleotides and chemically synthesized polynucleotides.

[0074] The term “recombinant” means, for example, that a nucleic acid sequence is made by an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated nucleic acids by genetic engineering techniques. A “recombinant DNA molecule or construct” comprises an isolated polynucleotide operably linked to at least one regulatory sequence. The term also embraces an isolated polynucleotide comprising a region encoding all or part of a functional RNA and at least one of the naturally occurring regulatory sequences directing expression in the source (e.g., organism) from which the polynucleotide was isolated, such as, but not limited to, an isolated polynucleotide comprising a nucleotide sequence encoding a herbicide resistant target gene and the corresponding promoter and 3′ end sequences directing expression in the source from which sequences were isolated.

[0075] “Gene” refers to a nucleic acid fragment that expresses a specific protein, including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. “Native gene” refers to a gene as found in nature with its own regulatory sequences. “Chimeric gene” refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. “Endogenous gene” refers to a native gene in its natural location in the genome of an organism. A “foreign” gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes. A “transgene” is a gene or recombinant DNA construct that has been introduced into the genome by a transformation procedure.

[0076] “Coding sequence” refers to a DNA sequence that codes for a specific amino acid sequence. “Regulatory sequences” refer to nucleotide sequences located upstream (5′ non-coding sequences), within, or downstream (3′ non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing, stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.

[0077] A polynucleotide sequence encoding a “portion” of a gene or coding sequence is a polynucleotide sequence encoding at least 10 amino acids and capable of producing an active fructosyltransferase in an embryo cell.

[0078] “Promoter” refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal elements, the latter elements often referred to as enhancers. Accordingly, an “enhancer” is a DNA sequence which can stimulate promoter activity and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. Promoters which cause a gene to be expressed in most cell types at most times are commonly referred to as “constitutive promoters”. New promoters of various types useful in plant cells are constantly being discovered; numerous examples may be found in the compilation by Okamuro and Goldberg, (1989) Biochemistry of Plants 15:1-82. It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity.

[0079] As used herein, “substantially similar” refers to polynucleotides, genes, coding sequences, and the like, wherein changes in one or more nucleotide bases results in substitution of one or more amino acids, but do not affect the functional properties of the polypeptide encoded by the nucleotide sequence. “Substantially similar” also refers to polynucleotides wherein changes in one or more nucleotide bases does not affect the ability of the polynucleotide to mediate alteration of gene expression by gene silencing through for example antisense or co-suppression technology. “Substantially similar” also refers to modifications of the polynucleotide of the instant invention such as deletion or insertion of one or more nucleotides that do not substantially affect the functional properties of the resulting transcript vis-a-vis the ability to mediate gene silencing or alteration of the functional properties of the resulting protein molecule. It is therefore understood that the invention encompasses more than the specific exemplary nucleotide or amino acid sequences and includes functional equivalents thereof. The terms “substantially similar” and “corresponding substantially” are used interchangeably herein.

[0080] Substantially similar polynucleotides may be selected by screening polynucleotides representing subfragments or modifications of the polynucleotides of the instant invention, wherein one or more nucleotides are substituted, deleted and/or inserted, for their ability to affect the level of the polypeptide encoded by the unmodified polynucleotides in a plant or plant cell. For example, a substantially similar polynucleotides representing at least one of 30 contiguous nucleotides derived from the instant polynucleotides can be constructed and introduced into a plant or plant cell. The level of the polypeptide encoded by the unmodified polynucleotides present in a plant or plant cell exposed to substantially similar polynucleotide can then be compared to the level of the polypeptide in a plant or plant cell that is not exposed to the substantially similar polynucleotides.

[0081] An “intron” is an intervening sequence in a gene that does not encode a portion of the protein sequence. Thus, such sequences are transcribed into RNA but are then excised and are not translated. The term is also used for the excised RNA sequences. An “exon” is a portion of the sequence of a gene that is transcribed and is found in the mature messenger RNA derived from the gene, but is not necessarily a part of the sequence that encodes the final gene product.

[0082] The “translation leader sequence” refers to a DNA sequence located between the promoter sequence of a gene and the coding sequence. The translation leader sequence is present in the fully processed mRNA upstream of the translation start sequence. The translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency. Examples of translation leader sequences have been described (Turner, R. and Foster, G. D. (1995) Molecular Biotechnology 3:225).

[0083] The “3′ non-coding sequences” refer to DNA sequences located downstream of a coding sequence and include polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3′ end of the mRNA precursor. The use of different 3′ non-coding sequences is exemplified by Ingelbrecht et al., (1989) Plant Cell 1:671-680.

[0084] “RNA transcript” refers to the product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA sequence derived from post-transcriptional processing of the primary transcript and is referred to as the mature RNA. “Messenger RNA (mRNA)” refers to the RNA that is without introns and that can be translated into protein by the cell. “cDNA” refers to a DNA that is complementary to and synthesized from a mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into the double-stranded form using the Klenow fragment of DNA polymerase 1. “Sense” RNA refers to RNA transcript that includes the mRNA and can be translated into protein within a cell or in vitro. “Antisense RNA” refers to an RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks the expression of a target gene (U.S. Pat. No. 5,107,065). The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5′ non-coding sequence, 3′ non-coding sequence, introns, or the coding sequence. “Functional RNA” refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes. The terms “complement” and “reverse complement” are used interchangeably herein with respect to mRNA transcripts, and are meant to define the antisense RNA of the message.

[0085] The term “operably linked” refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., that the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in sense or antisense orientation.

[0086] “Transformation” refers to the transfer of a nucleic acid fragment into the genome of a host organism. Host organisms containing the transformed nucleic acid fragments are referred to as “transgenic” organisms. “Stable transformation” refers to the transfer of a nucleic acid fragment into a genome of a host organism, including both nuclear and organellar genomes, resulting in genetically stable inheritance. In contrast, “transient transformation” refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing organelle, of a host organism resulting in gene expression without integration or stable inheritance. The preferred method of cell transformation of plant cells is the use of particle-accelerated or “gene gun” transformation technology (Klein et al., (1987) Nature (London) 327:70-73; U.S. Pat. No. 4,945,050), or an Agrobacterium-mediated method using an appropriate Ti plasmid containing the transgene (Ishida Y. et al., 1996, Nature Biotech. 14:745-750).

[0087] “Insert,” “transfer,” “introduce,” and the like refer to the action of using a nucleic acid fragment in the process of transformation.

[0088] Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter “Sambrook”). “PCR” amplification or “Polymerase Chain Reaction” is a technique for the synthesis of large quantities of specific DNA segments, consists of a series of repetitive cycles (Perkin Elmer Cetus Instruments, Norwalk, Conn.). Typically, the double stranded DNA is heat denatured, the two primers complementary to the 3′ boundaries of the target segment are annealed at low temperature and then extended at an intermediate temperature. One set of these three consecutive steps is referred to as a cycle.

[0089] Invention

[0090] The present invention includes a plant and plant part comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to at least a portion of at least one coding sequence for a plant fructosyltransferase, operably linked to a vacuole targeting sequence, said molecule sufficient to express a protein capable of producing fructan having a degree of polymerization of at least three, in an embryo of said plant, or any progeny thereof, wherein said progeny comprise said molecule.

[0091] In accordance with the present invention, a plant includes and is not limited to a plant, expressing a protein capable of producing fructan having a DP of at least three in the embryo. Such fructan producing plants include dicots and monocots. Dicots include and are not limited to legumes, including soybean, and the like. Monocots include and are not limited to cereals, also known as grasses, including and are not limited to corn and the like, for example.

[0092] Also within the scope of the invention are plant parts obtained from such plants. Plant parts include differentiated and undifferentiated tissues, including but not limited to, embryos, roots, stems, shoots, leaves, pollen, seeds, grains, tumor tissue, and various forms of cells and culture such as and not limited to single cells, protoplasts, embryos, and callus tissue. The plant tissue may be in plant, organ, tissue or cell culture. Grain and seed are used interchangeably herein. In addition, a corn kernel is a grain.

[0093] The term “corn” refers to Zea mays, and is used herein interchangeably with maize. The term “soybean” refers to Glycine max. The term “Jerusalem artichoke” refers to Helianthus tuberosus, Term “guayule” refers to Parthenium argentatum. The term “chicory” refers to Cichorium intybus. The term “tomato” refers to Lycopersicon esculentum.

[0094] In accordance with the present invention, plant sources may be the plant per se. In addition, the plant source of the subject invention includes and is not limited to the seed, grain, plant cells, plant protoplasts, plant cell tissue culture from which plants may be regenerated, plant calli, plant clumps, and plant cells that are intact in plants or parts of plants such as embryos, pollen, flowers, ears, cobs, leaves, husks, stalks, roots, root tips, anthers, silk, and the like.

[0095] As used herein, “embryo” refers to the embryo axis and cotyledons in dicots and the embryo axis and scutellum in monocots. “Embryo specific promoter” refers to a promoter which is expressed throughout the embryo axis, the cotyledons, or the embryo axis and coytledons in dicots and embryo axis, the scutellum, or the embryo axis and scutellum in monocots. Preferred embryo-specific promoters are seed protein promoters, which may be expressed in the cotyledons or the cotyledons and embryo axis.

[0096] To date, a multitude of embryo-specific promoters are known which direct strong seed-specific expression of a transgene or recombinant DNA construct. Examples include, but are not restricted to, the Kunitz trypsin inhibitor (KTi) promoter (Jofuku et al. (1989) Plant Cell 1:1079-1093; Perez-Grau, L. and Goldberg, R. (1989) Plant Cell 1:1095-1109), the Phaseolin promoter (Burow, M. D. et. al. (1992) Plant J. 2:537-548), the promoter of the gene for the α′-subunit of the β-conglycinin (Beachy, R. N. et al. (1985) EMBO J. 4:3047-3053; Harada, J. J. et. al. (1989) Plant Cell 1: 415-425), the soybean 2S-albumin promoter (Coughlan, S. J. and Winfrey, R. J, U.S. Pat. No. 6,177,613 issued January 2001), the soybean Glycinin promoter (Nielsen, N. C. et., al. (1989) Plant Cell 1:313-328), the maize (Zea mays L.) globulin-1 promoter (Belanger F. C. and Kriz A. L.(1991) Genetics 129:863-872), the maize oleosin promoter (Lee, K. and Huang, A. H. (1994) Plant Mol. Biol. 26:1981-1987).

[0097] In accordance with the present invention, “vacuole targeting sequence,” also referred to as vacuole sorting signals (BMBP, page 192) refers to a sequence that after translation directs a gene product, polypeptide, protein, or the like to a vacuole. Vacuole targeting sequences are known in the art and are operably linked to the other parts of the recombinant DNA molecule (BMBP, pages 192-193).

[0098] “Fructosyltransferase” refers to a protein coded for by any one of several genes having the property of producing a carbohydrate polymer consisting of repeating fructose residues. Fructosyltransferases may be isolated from a plant or bacterial source. The repeating fructose residues may be linked by β2-1 linkage, a p2-6 linkage, or any combination of the two types of linkages. The polymer of repeating fructose residues may contain one terminal glucose residue, derived from a sucrose molecule, and at least two fructose residues. The polymer of repeating fructose residues in any form, with any combination of linkages, and with any number of fructose residues, is referred to generally as a “fructan”. Fructosyltransferases include and are not limited to fructose:fructose fructosyltransferase and sucrose:sucrose fructosyltransferase.

[0099] A “fructosyltransferase gene” or “ftf” refers to the polynucleotide coding for a fructosyltransferase protein. “FTF” refers to fructosyltransferase protein or fructosyltransferase protein activity. The term “deleterious effect” as used herein, refers to a direct or indirect injurious effect on a plant or plant cell as a result of fructan accumulation, such that the plant or plant cell is prevented from performing certain functions including, but not limited to, synthesis and transport of carbohydrates within a cell and throughout the plant, regeneration of transgenic plants or tissue, development of the plant or plant cell to maturity, or the ability to pass the desired trait or traits to progeny. For purposes of the present invention, frutosyltransferases and coding sequences therefor may be isolated from plant or bacterial sources. Plants are the preferred source of fructosyltransferase coding sequences. Such plant sources include and are not limited to Jerusalem artichoke and guayule.

[0100] “Fructan” refers to any compound in which one or more fructosyl-fructose linkages constitute a majority of the linkages (the presence of a glucose unit is optional).

[0101] “Fructose” refers to a very sweet sugar, C₆H₁₂O₆, occurring in many fruits and honey and used as a preservative for foodstuffs and as an intravenous nutrient. Fructose is also known as fruit sugar, levulose.

[0102] A “fructosyl unit” refers to a fructose molecule linked to another sugar molecule (e.g. glucose, fructose, galactose, mannose).

[0103] “Inulin” refers to fructan that has mostly β-2,1 fructosyl-fructose linkages (the presence of a glucose unit is optional). “Degree of polymerization” or “DP” refers to the number of fructose residues contained in an individual fructan polymer. DP varies greatly depending on the source from which the fructan is isolated. For purposes of the present invention, a transgenic plant should be capable of producing fructan having a degree of polymerization of at least three. Thus, a plant embryo comprising a coding sequence for fructosyltransferase in accordance with the present invention produces fructan having a degree of polymerization of at least three.

[0104] In accordance with the present invention, a monocot embryo comprising a transgene for sucrose:sucrose fructosyltransferase, as well as a grain (corn kernels for example) containing a monocot embryo, contain fructan exhibiting a degree of polymerization of at least three and includes fructan having a degree of polymerization of at least four, at least five, at least six, and at least seven. A monocot comprising a transgene for sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase, contains in the embryo or grain containing the embryo fructan having a degree of polymerization of at least three, and also includes fructan having degrees of polymerization of at least four, at least five, at least six, at least seven, at least eight, at least nine and at least ten. Fructan having a degree of polymerization of up to about 200 may be obtained from plants.

[0105] A dicot embryo transformed with a coding sequence for sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase produces fructan having a degree of polymerization of at least three, and includes fructan having degrees of polymerization of four and five.

[0106] It is expected that a dicot embryo transformed with a coding sequence for sucrose:sucrose fructosyltransferase will also produce fructan with a degree of polymerization of at least three. In the same manner, progeny thereof are expected to produce fructan having a degree of polymerization of at least three.

[0107] In accordance with the present invention, the plant cell may be transformed by at least one recombinant DNA molecule that results in production of fructan having a degree of polymerization of at least three. Such recombinant DNA molecule includes and is not limited to a recombinant DNA molecule encoding at least a portion of a coding sequence for a plant fructosyltransferase, wherein the fructosyltransferase is sucrose:sucrose fructosyltransferase, and a recombinant DNA molecule encoding at least a portion of a coding sequence for a eukaryotic fructosyltransferase, wherein the fructosyltransferase is sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase.

[0108] Also included as recombinant DNA molecule is a recombinant DNA molecule comprising sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase. Yet another recombinant DNA molecule is a first recombinant DNA molecule and a second recombinant DNA molecule, wherein the first DNA molecule comprises a coding sequence for sucrose:sucrose fructosyltransferase and the second DNA molecule comprises a coding sequence for fructose:fructose fructosyltransferase.

[0109] The transformed plant is then grown under conditions suitable for the expression of the recombinant DNA molecule. Expression of the recombinant DNA molecule results in fructan having a degree of polymerization of at least three.

[0110] The present invention is also directed to a method of producing fructan in a plant comprising constructing at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least one coding sequence for a fructosyltransferase, transforming a plant with the construct, regenerating the plant to produce seed, harvesting seed from the plant, and extracting fructan from the harvested seed.

[0111] The regenerated plant may be multiplied to obtain a useful amount of seed that may be employed in large scale growth, such as farming, of crops from which fructan may be obtained. In addition, grain per se, comprising a transgene for a fructosyltransferase, is useful as feed for animals.

[0112] The present invention also includes a method of screening transgenic plant tissue expressing embryo targeted genes comprising preparing Type-II maize callus for transformation, transforming callus, selecting transgenic callus lines, regenerating transgenic somatic embryos, and propagating transgenic somatic embryos for plant production and early trait analyses.

[0113] Another embodiment of the present invention is a foodstuff comprising fructan produced by a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least one coding sequence for a fructosyl-transferase, the molecule sufficient to express fructan of at least DP3 in a grain of the plant, or any progeny thereof, wherein the progeny comprise said molecule. The fructan of such plant may be inulin.

[0114] Industrial products comprising fructan produced in accordance with the present invention are also included herein. Such industrial products include and are not limited to a hydrocolloid, a bleach activator, a dispersing agent, glue, and a biodegradable complexing agent.

[0115] Also, within the scope of this invention are food and beverages which have incorporated therein a fructan product of the invention. The beverage can be in a liquid or a dry powdered form.

[0116] “Foodstuff,” including “food” and “feed,” is used herein to mean substances for consumption that contain fructan or grain from the plant of the present invention. Grain, for example, is useful in such food and feed, for humans and animals, respectively.

[0117] The foods to which fructan of the invention can be incorporated/added include almost all foods/beverages. For example, there can be mentioned meats such as ground meats, emulsified meats, marinated meats, and meats injected with a product of the invention; beverages such as nutritional beverages, sports beverages, protein fortified beverages, juices, milk, milk alternatives, and weight loss beverages; cheeses such as hard and soft cheeses, cream cheese, and cottage cheese; frozen desserts such as ice cream, ice milk, low fat frozen desserts, and non-dairy frozen desserts; yogurts; soups; puddings; bakery products; and salad dressings; and dips and spreads such as mayonnaise and chip dips. Fructan can be added in an amount selected to deliver a desired dose to the consumer of the food and/or beverage.

[0118] Depending on their origin, fructan vary greatly in size and functionality allowing for the use of fructan in a wide variety of commercial applications. Fructan with a low DP have a sweet taste while fructan with a higher DP provide better functionality and a texture similar to fat. The food industry uses fructan as low calorie replacements because the human body is not capable of metabolizing them. Furthermore, the food industry uses fructan to make functional and healthier foods and food additives. This health effect is based on the observation that fructan, which reach the colon intact, are fermented resulting in prebiotic effects towards certain beneficial species of Bifidobacteria and advantageous effects promoting overall health. These health effects include improvement of intestinal microflora, protection against intestinal infections, prevention of constipation, reduction of serum cholesterol, increased mineral absorption, anti-colon-cancer effects and increased production of B-vitamins (Information pamphlet of Imperial-Sensus, Sugar Land, Tex. 77487). The feed industry also takes advantage of animals being incapable of metabolizing fructan. Thus, the addition of fructan to feed enhances animal health and performance through selective fermentation by beneficial organisms such as Bifidibacteria at the expense of pathogenic organisms such as E. coli and Salmonella. This selective fermentation leads to altered fatty acid profiles, increased nutrient absorption, and decreased levels of blood cholesterol in the animal. Fructan is also considered to be an excellent source of fructose for the production of high-fructose syrup. Fructose may be obtained by the hydrolysis of fructan into individual fructose residues. This process for the preparation of fructan has a tremendous advantage over the current, technically demanding, process of enzymatically converting starch into high fructose syrup. Using fructan as the starting material would, therefore, significantly reduce production costs. Fructan with a medium to high DP are useful for industrial applications, such as the production of biodegradable complexing agents for heavy metals, biodegradable glues, filler/binders and surfactants.

[0119] The most commonly used fructan to date is inulin, which is commercially obtained by extraction of plants or plant parts. Inulin is a polydisperse carbohydrate built up of fructose units, with an optional glucose unit, that cannot be digested by the human digestive enzymes and reaches the colon intact. In addition to inducing a health benefit in humans and animals, inulin has some nutritional as well as functional benefits that result in advantageous qualities in food and feed. The nutritional benefits are mainly found in the fact that inulin is a soluble dietary fiber, has a low caloric value, and is suitable for diabetics. The functional benefits of inulin include, in part, its function as a water soluble compound, texturizer, taste improver, good solubility, sugar and fat replacer, fiber enrichment, and use in filler/binder for tablets. Given the inulin benefits mentioned above, inulin has been used in the manufacture of a wide variety of food and feed products as well as drinks and non-food products. Depending on the application, inulins with a different profile are used. Inulins of DP2 to DP7, also referred to as oligofructose, are commonly used as low caloric sweeteners. Low DP inulins as well as inulins with an average DP of 9 are also used as a soluble dietary fiber and as an ingredient in food and feed products emphasizing health benefits. Inulins with an average DP of 10 and average DP of >23 are commercially available (Orafti, Tienen, Belgium) and are mainly used in food and feed products for their functional benefits described above.

[0120] Commercial use of fructan is currently severely limited due to the high cost and low acreage of production. Fructan used in low-calorie foods are currently extracted from chicory (Cichorium intybus) and Jerusalem artichoke (Helianthus tuberosus). Larger polymers synthesized by bacteria are not currently produced on a commercial scale. Chicory and Jerusalem artichoke are cultivated mainly in Europe and using non-economic farming practices. A few crops adapted to wide growing regions, such as oat, wheat, and barley, accumulate fructan and only at extremely low levels.

[0121] The disclosure of each reference set forth in this application is incorporated herein by reference in its entirety.

EXAMPLES

[0122] The present invention is further illustrated in the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various uses and conditions. Thus, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.

Example 1 Method For Early Screening of Embryo-Targeted Traits in Transgenic Maize

[0123] A method for early screening of embryo-targeted traits in transgenic maize using transgenic somatic embryos was developed. The method consists of:

[0124] 1) preparing Type-II maize callus for transformation,

[0125] 2) transforming callus using the particle bombardment technique,

[0126] 3) selecting transgenic callus lines,

[0127] 4) regenerating transgenic somatic embryos,

[0128] 5) propagating transgenic somatic embryos for early trait analyses and plant production, and.

[0129] 6) analyzing somatic embryos for phenotypic trait.

[0130] 1. Preparation of Callus for Transformation

[0131] A rapidly growing Type-II maize callus is transferred to #4 Whatman filter paper placed on a modified Chu (N6) callus maintenance medium (Chu, C. C., et al. (1975) Scientia Sinic. 18:659). The callus is spread in a thin layer covering the filter paper in a circular area of approximately 4 cm in diameter, the filter paper is transferred to a petri dish, and is incubated in the dark in a growth chamber (45% humidity, 27-28° C.) for two to four days before transformation via gold particle bombardment. On the day of bombardment, the callus-containing filter is transferred to a petri dish containing modified Chu (N6) high osmoticum medium, wrapped with parafilm, and placed in the dark growth chamber for four additional hours. Just prior to bombardment, the petri dishes are left partially ajar for thirty minutes in the laminar flow hood to allow moisture on the tissue to dissipate.

[0132] 2. DNA/Gold Preparation and Particle Bombardment Procedure

[0133] DNA is precipitated onto gold particles and the corn callus is bombarded with DNA/gold according to the method of Fromm et al. (Fromm et al. (1990) Biotechnology (NY) 8:833-839).

[0134] 3. Selection of Transgenic Maize Callus Lines

[0135] Transgenic maize callus lines are selected by transferring the filter paper containing the callus through different media as follows:

[0136] Transfer 1: Within 60 minutes following bombardment, callus-containing filter papers are placed onto fresh callus maintenance medium, wrapped with parafilm, and incubated in the dark chamber for 3-4 days.

[0137] Transfer 2: After 3-4 days, plates containing filters with bombarded callus are checked for contamination and 3-4 mm clumps of callus are subcultured onto selection medium which is a modified Chu (N6) medium supplemented with 2-10 ppm bialaphos. Plates containing the newly subcultured callus on selection medium are wrapped with parafilm and incubated in the dark.

[0138] Transfer 3: After about 7-14 days (depending on growth rate) larger clumps are split into several smaller pieces, keeping track of all pieces originating from each original clump, and subcultured onto fresh selection medium, as above.

[0139] Transfer 4: After another ˜14 days all callus are transferred onto fresh selection medium containing bialaphos, keeping track of the lineage of each piece as above. If needed, clumps may again be split into several pieces at this transfer.

[0140] Transfer 5: After 2 or 3 weeks, callus may be transferred onto fresh selection medium, keeping track of unique lines as above. This depends on the growth of the tissue and the experiment. Approximately 2-3 weeks after transfers 4 or 5, bialaphos-tolerant, rapidly-growing callii (transformation events) are identified and individually subcultured onto fresh selection. Callii are incubated in this medium for another two-weeks.

[0141] 4. Regeneration of Transgenic Somatic Embryos

[0142] Transgenic callus events are isolated onto plates of fresh selection medium, one to four independent callus events per plate. After two weeks, each event is assigned a number, sampled for PCR analysis, placed in an individual plate containing a modified MS medium (Murashige, T. and Skoog, F. (1962) Physiol. Plant. 15:473), and grown in the dark for 10-14 days. This step is the first stage of regeneration to plants through somatic embryogenesis. During this time, the embryogenic callus grows to form many discrete, hard, white somatic embryos.

[0143] 5. Propagation of Somatic Embryos for analysis and Regeneration of Transgenic Plants

[0144] After 10-14 days in the dark on first-stage regeneration medium, some of the hard, white somatic embryos are used for analyses and at the same time some are regenerated into plants. For analysis the somatic embryos may be transferred to empty plastic sample dishes and analyzed immediately, may be transferred to empty plastic sample dishes and frozen immediately at −78° C. until analyzed, or may be transferred onto second-stage regeneration medium (a modified MS medium, in which the concentration of MS salts is reduced to one-half the concentration normally used (Murashige, T. and Skoog, F. (1962) Physiol. Plant 15: 473) for transport and later analysis. For regeneration into plants, the hard, white somatic embryos are transferred onto second-stage regeneration medium and placed in the light at 26° C.

[0145] 6. Analysis of Somatic Embryos for Transgenic Phenotypic Trait

[0146] Somatic embryos are ground to homogeneity and analyzed for phenotypic trait such as protein, oils, carbohydrates (such as in Examples 5 and 9), isoflavones, flavones, etc.

Example 2 Construction of a Cassette for Embryo-Targeted Expression of Jerusalem Artichoke SST in Transgenic Zea mays L.

[0147] A cassette designed for the embryo-specific expression in maize of the Jerusalem artichoke sucrose:sucrose fructosyltransferase (SST) was assembled. This cassette, GLOBSST01(f) is shown in FIG. 1 contains a maize embryo-specific globulin promoter directing translation of the entire Jerusalem artichoke SST coding region followed by a 3′ nos termination signal.

[0148] The GLOBSST01(f) cassette was assembled into plasmid vector pGLOBSST01 by replacing the maize endosperm-specific 10 kD zein promoter in plasmid 10 kD-SST-17 with the maize embryo-specific globulin promoter. Plasmid 10 kD-SST-17 (described in PCT publication No. WO99/46395, published Sep. 16, 1999) contains the 10 kD zein promoter directing the expression of the Jerusalem artichoke SST, including native and secretory vacuolar signals. To assemble plasmid 10 kD-SST-17 an intermediary plasmid was assembled by removing the polynucleotide fragment encoding SacB from plasmid pCyt-SacB (described by Caimi et al. (1996) Plant Physiol. 110:355-363) by digesting with Nco I and Hind III and inserting the polynucleotide fragment encoding the Jerusalem artichoke SST that had been removed from plasmid pSST403 (described in PCT publication WO 96/21023, published Jul. 11, 1996) by digestion with Nco I and Hind III. The polynucleotide fragment comprising the 10 kD zein promoter and SST coding region was removed from this intermediary plasmid by digestion with Sma I and Bam HI. The 10 kD-SST fragment was then inserted into Sma I and Bam HI-digested plasmid pKS17 to form plasmid 10 kD-SST-17. Plasmid pKS17 was derived from the commercially-available plasmid pSP72 (Promega Biotech, Madison, Wis.) by deleting from pSP72 the polynucleotide fragment corresponding to the beta lactamase coding region (nucleotides 1135 through 1995) and inserting between the E. coli T7 RNA polymerase promoter and termination signal a polynucleotide fragment encoding HPT. The polynucleotide fragment encoding HPT corresponds to the polynucleotide fragment from E. coli strain W677 encoding hygromycin B phosphotransferase which, when under the control of a bacterial promoter, allows for selection of transformed cells in certain bacteria (Gritz, L. and Davies, J. (1983) Gene 25:179-188). Finally, the embryo-specific globulin promoter described in U.S. Pat. No. 5,773,691 was used to replace the 10 kD zein endosperm-specific promoter in plasmid 10 kD-SST-17. To do this, first, an Nco I restriction endonuclease site present in the globulin promoter in plasmid pCC50 was destroyed to form plasmid pBT747. Then, the polynucleotide fragment containing the sequences for the globulin promoter were removed from plasmid pBT747 by digestion with Sal I and Nco I and the fragment containing the globulin promoter was used to replace the 10 kD zein promoter in plasmid 10 kD-SST-17 to create plasmid pGLOBSST01. The sequence of plasmid pGLOBSST01 is shown in SEQ ID NO:1.

[0149] Digestion of pGLOBSST01 with Hind III yields a 3378 bp DNA fragment containing the SST coding region surrounded by the embryo-specific globulin promoter and the nos 3′ transcription termination region. This fragment was designated GLOBSST01(f), is shown in FIG. 1, and contains the complete embryo-specific SST expression cassette. The GLOBSST01(f) DNA fragment was purified by gel electrophoresis and was used for transformation into corn by particle bombardment as described below.

Example 3 Construction of a Cassette for Embryo-Targeted Expression of Jerusalem Artichoke FFT in Transgenic Zea mays L.

[0150] A cassette designed for the embryo-specific expression in maize of the Jerusalem artichoke fructan:fructan fructosyltransferase (FFT) was assembled. This cassette, GLOBFFT01(f), contains a maize embryo-specific globulin promoter directing translation of the entire Jerusalem artichoke FFT coding region followed by a 3′ nos termination signal.

[0151] The GLOBFFT01(f) cassette was assembled into plasmid pGLOBFFT01 by replacing the maize endosperm-specific 10 kD zein promoter in plasmid 10 kD-FFT-17 with the maize embryo-specific globulin promoter. Plasmid 10 kD-FFT-17 (described in PCT publication No. WO99/46395, published Sep. 16, 1999) contains the 10 kD zein promoter directing the expression of the Jerusalem artichoke FFT, including native and secretory vacuolar signals. To assemble plasmid 10 kD-FFT-17 an intermediary plasmid was constructed by removing the polynucleotide fragment encoding SacB from plasmid pCyt-SacB (described by Caimi et al. (1996) Plant Physiol. 110:355-363) by digestion with Nco I and Bam HI and replacing this fragment with the polynucleotide fragment encoding the Jerusalem artichoke FFT from plasmid pSST405 (described in PCT publication WO 96/21023, published Jul. 11, 1996). The polynucleotide fragment containing the 10 kD zein promoter and the FFT coding region was removed from this intermediary plasmid by digestion with Sma I and Sal I. The 10 kD-FFT fragment was inserted into plasmid pKS17 (described in Example 2) that had been digested with Sma I and Bam HI to form plasmid 10 kD-FFT-17. Finally, the embryo-specific globulin promoter was removed from plasmid pBT747 (described in Example 3) by digesting with Sma I and Nco I and used to replace the 10 kD zein endosperm-specific promoter in plasmid 10 kD-FFT-17 to create plasmid pGLOBFFT01. The sequence of plasmid pGLOBFFT01 is shown in SEQ ID NO:2.

[0152] Digestion of pGLOBFFT01 with Hind III yields a 3344 bp DNA fragment, containing the FFT coding region surrounded by the embryo-specific globulin promoter and the nos 3′end. This fragment was designated GLOBFFT01(f), is depicted in FIG. 2, and contains the complete embryo-specific FFT expression cassette. This fragment was purified by gel electrophoresis and was used for transformation into corn by particle bombardment as described below.

Example 4 Transformation of Corn Embryos with Embryo-Targeted SST. FFT, or Both and Detection of the Transgenes

[0153] Transformation of Corn Embryogenic Calli

[0154] For corn embryogenic calli transformation, the purified DNA fragments containing the embryo-specific cassettes were co-bombarded with pDetric, a polynucleotide fragment containing the bar gene under the control of the CaMV 35S promoter and OCS 3′-end. The bar gene (Murakami et al. (1986) Mol. Gen. Genet 205:42-50; DeBlock et al. (1987) EMBO J. 6:2513-2518) encodes phosphinothricin acetyl transferase (PAT) which confers resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin (bialophos). The sequence of the Hind III polynucleotide fragment corresponding to pDETRIC is shown in SEQ ID NO:3. Other selectable markers may be used in the invention such as, and not limited to, pALSLUC (Fromm, et al, (1990) Biotechnology 8:833-839) that contains polynucleotides encoding a mutant acetolactate synthase (ALS) that confers resistance to chlorsulfuron under the control of the CaMV 35S promoter.

[0155] Embryogenic maize callus derived from crosses of the inbred lines Al 88 and B73 (Armstrong et al.(1991) Maize Genetics Cooperation Newsletter 65:92-93) were co-transformed with pDetric and pGLOBSST01(f), or with pDetric, pGLOBSST01(f), and pGLOBFFT01(f) using microprojectile bombardment (Klein T. M. et. al. (1987) Nature 327:70-73).

[0156] Transformed embryogenic cells were recovered on medium containing glufosinate-ammonium. Transgenic embryos selected as in Example 1 were analyzed for production of fructan or transferred to 12 inch pots containing METROMIX™ soil (Scotts-Sierra Company, Marysville, Ohio) and grown to maturity in the greenhouse

[0157] Detection of SST and FFT Transgenes

[0158] The presence of the SST and FFT in transgenic embryos or plants was accomplished by PCR analyses of RNA obtained from leaf tissue.

[0159] Oligonucleotide primers SST-1 (SEQ ID NO:4) and SST-2 (SEQ ID NO:5) were used to detect the polynucleotide fragment encoding Jerusalem artichoke SST. Oligonucleotide primers FFT-1 (SEQ ID NO:6) and FFT-2 (SEQ ID NO:7) were used to detect the polynucleotide fragment encoding Jerusalem artichoke FFT. SST-1: 5′- TTCGTAACTCAGTTGCCAAATATTG-3′ (SEQ ID NO:4) SST-2: 5′- CCAGCCCGTTTGTGTGTACGGT-3′ (SEQ ID NO:5) FFT-1: 5′- GTTCGTATCGTCACCAATTCG-3′ (SEQ ID NO:6) FFT-2: 5′- GTGCACTATCATTGGTTAACG-3′ (SEQ ID NO:7)

[0160] After amplification and separation of the DNA fragments by polyacrylamide gel electrophoresis, transgenic maize somatic embryos or transgenic plants were identified that contained only the Jerusalem artichoke SST, only the Jerusalem artichoke FFT, or both, the Jerusalem artichoke SST and the Jerusalem artichoke FFT.

Example 5 Corn Fructan Composition Analyses

[0161] The carbohydrate composition of transgenic somatic embryos or transgenic plants identified in Example 4 as containing the GLOBFFT01(f) and/or GLOBSST01(f) cassettes was measured by high performance anion exchange chromatography/pulsed amperometric detection (HPAE/PAD). Individual seeds from transgenic lines were harvested at 35-50 days post-pollination (DPP) for detection of carbohydrate composition. The seeds were frozen in liquid nitrogen, ground with a mortar and pestle, and transferred to 15 mL microcentrifuge tubes. Fresh individual somatic embryos were rapidly washed in water, dried on a paper towel, and transferred into 1.5 mL microcentrifuge tubes. Ethanol (80%) was added to the tubes and the samples were heated to 70° C. for 15 minutes. The samples in the 15 mL tubes were centrifuged at 4, 000 rpm and the samples in the 1.5 mL tubes were centrifuged at 14,000 rpm for 5 minutes at 4° C. and the supernatant collected. The pellet was re-extracted two additional times with 80% ethanol at 70° C. The supernatants were combined, dried down in a speedvac, and the pellet re-suspended in water.

[0162] For HPAE analysis, the extracts were filtered through a 0.2 μm Nylon-66 filter (Rainin, Emeryville, Calif.) and analyzed by HPAE/PAD using a DX500 anion exchange analyzer (Dionex, Sunnyvale, Calif.) equipped with a 250×4 mm CarboPac PA1 anion exchange column and a 25×4 mm CarboPac PA guard column. Soluble carbohydrates and inulin were separated with a 30 minute linear gradient of 0.5 to 170 mM NaAc in 150 mM NaOH at a flow rate of 1.0 mL/min. A mixture of 20 mg/L of glucose, fructose, sucrose, raffinose, stachyose, 1-kestose (DP3), 1-kestotetraose (DP4), and 1-kestopentaose (DP5, Megazyme, Bray, Ireland) was used as a standard.

[0163] Soluble sugars as well as 1-kestose, 1-kestotetraose, and 1-kestopentaose were quantified by comparison to standards using HPAE/PAD. To quantify inulin, the fructan molecules were hydrolyzed with 150 mM HCl and incubated at 60° C. for up to 60 minutes. This solution was neutralized by addition of NaOH and the released fructose was quantified using HPAE/PAD. In the present application fructan is expressed in μmol hexose equivalent/g fresh weight (μmol/g f w).

[0164] Carbohydrate Analysis of Transgenic Maize Somatic Embryos

[0165] A carbohydrate profile resulting from HPAE/PAD analysis of maize somatic embryos not expressing the GLOBSST01(f) or GLOBFFT01(f) cassettes is shown in FIG. 3. A carbohydrate profile resulting from HPAE/PAD analysis of transgenic maize somatic embryos expressing intact copies of the GLOBSST01(f) cassette is shown in FIG. 4, and resulting from transgenic maize somatic embryos expressing GLOBSST01(f) and GLOBFFT01(f) cassettes is shown in FIG. 5.

[0166] The carbohydrate profile in FIG. 3 shows that inulin is not detected in maize somatic embryos not expressing the GLOBSST01(f) or GLOBSST01(f) cassettes. The carbohydrate profile in FIG. 4 shows that transgenic maize somatic embryos expressing the GLOBSST01(f) cassette accumulated inulin-type fructose polymers of DP3 and DP4 and in FIG. 5 shows that transgenic maize somatic embryos expressing both, GLOBSST01(f) and GLOBFFT01(f), cassettes accumulated inulin-type fructose polymers of DP3 through DP7.

[0167] Inulin-accumulating embryos were allowed to develop into plants using standard tissue culture techniques.

[0168] Carbohydrate Analysis of Transgenic Maize Seeds

[0169] Individual mature kernels were obtained from transgenic plants of all events. The kernels were frozen in liquid nitrogen and processed for analyses as indicated above. The carbohydrate profile resulting from HPAE/PAD analysis of kernels from transgenic maize plants not containing GLOBSST01(f) or GLOBFFT01(f) cassettes are shown in FIG. 6. Carbohydrate profiles of kernels from transgenic maize plants containing intact copies of the GLOBSST01(f) cassette are shown in FIG. 7 and of kernels from transgenic maize plants expressing, both, GLOBSST01(f) and GLOBSST01(f), cassettes are shown in FIG. 8.

[0170] No inulin was detected in kernels from transgenic plants not containing the GLOBSST01(f) or the GLOBFFT1(f) cassettes (FIG. 6). Kernels from transgenic plants expressing the GLOBSST01(f) cassette accumulated inulin-type fructose polymers of DP3 through DP7 (FIG. 7). Kernels from transgenic plants expressing both, GLOBSST01(f) and GLOBFFT01(f), cassettes accumulated inulin-type fructose polymers of DP3 through DP10 (FIG. 8). All events that accumulated inulin-type fructose polymers at the somatic embryo stage also accumulated inulin-type fructose polymers at the mature seed stage.

[0171] The results shown in FIGS. 3 through 8 indicate two things. First, that fructan may be produced in maize somatic embryos and that these embryos develop into maize plants that produce kernels that make fructan. Second, that following the method of Example 1 a phenotypic kernel trait may be screened at the maize somatic embryo stage and the same trait will be detected in seed from the mature plant. Therefore, the method of Example 1 provides a powerful screening tool for selecting positive transformants at a very early stage. The ability to screen early and obtain the same results as with mature plants results in major labor, financial, and time savings as a substantially less amount of somatic embryos need to be regenerated into plants, as well as less plants need to be maintained for seed production.

[0172] Table 1 lists a summary of the results obtained from transforming potato, corn, or mutant corn with SST and FFT, and compares the results according to the tissue analyzed and the expression pattern of the transgene. TABLE 1 Accumulation of Inulin-type Fructose Polymers in Transgenic Plants Plant Species Tissue Gene Expression Inulin (μmol/g f w) Potato tuber tuber  37.43¹ Corn² Seed endosperm  22.39 Corn seed embryo  80.02 Corn embryo embryo 800.2³

[0173] Transgenic kernels expressing the GLOBSST01(f) and GLOBFFT01(f) cassettes accumulated up to 80.02 μmol/g fresh weight fructan. Since the corn embryo alone accounts for 10-20% of the total seed weight, fructan accumulation in the germ can be as high as 800 μmol/g fresh weight. Table 1 shows that the inulin amounts observed in transgenic kernels produced by the method described in this invention is substantially higher than that reported for potato tubers (up to 37.43 μmol/g fresh weight). The differences in inulin production may be due to differences in storage reserve composition and underlying physiology as described in the background section of this invention.

Example 6 Construction of Chimeric Vectors for Embryo-Targeted Expression of the Guayule 1-SST and 1-FFT in transgenic Glycine max

[0174] Vectors designed for the embryo-specific expression in soybean of guayule (Parthenium argentatum) sucrose:sucrose fructosyltransferase (1-SST) and fructan:fructan fructosyl transferase (1-FFT) were assembled. Vectors pJMS02, pRM02 were designed to express guayule SST, vectors pJMS01 and pRM03 were developed to express guayule FFT, and vectors pRM01 and pRM04 were intended to express both, guayule SST and guayule FFT.

[0175] Identification of cDNA Clones Encoding Guayule SST and FFT

[0176] Using a guayule (Parthenium argentatum) stem bark library cDNA clones encoding guayule SST and FFT were identified by conducting BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol. Biol. 215:403-410) searches for similarity to sequences contained in the BLAST “nr” database (comprising all non-redundant GenBank CDS translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and DDBJ databases). The cDNA sequences were analyzed for similarity to all publicly available DNA sequences contained in the “nr” database using the BLASTN algorithm provided by the National Center for Biotechnology Information (NCBI). The DNA sequences were translated in all reading frames and compared for similarity to all publicly available protein sequences contained in the “nr” database using the BLASTX algorithm (Gish and States (1993) Nat. Genet. 3:266-272) provided by the NCBI. For convenience, the P-value (probability) of observing a match of a cDNA sequence to a sequence contained in the searched databases merely by chance as calculated by BLAST are reported herein as “pLog” values, which represent the negative of the logarithm of the reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the cDNA sequence and the BLAST “hit” represent homologous proteins.

[0177] The BLASTX search using the sequences from clone epb3c.pk007.n11 revealed similarity of the polypeptides encoded by the cDNAs to 1-SST from Helianthus tuberosus (NCBI General Identifier No. 3367711) with a pLog higher than 180.00. The BLASTX search using the sequences from clone epb3c.pk007.j9 revealed similarity of the polypeptides encoded by the cDNAs to 1-FFT from Helianthus tuberosus (NCBI General Identifier No. 3367690) with a pLog higher than 180.00.

[0178] Amplification of Polynucleotides Encoding Guayule 1-SST or 1-FFT

[0179] Polynucleotide fragments encoding the guayule 1-SST or 1-FFT in clones epb3c.pk007.n11 and epb3c.pk007.j9 were amplified by standard PCR methods using Pfu Turbo DNA polymerase (Stratagene, La Jolla, Calif.) and the following primer sets. The oligonucleotide primers were designed to add Not I restriction endonuclease sites at each end of the 1-SST and 1-FFT polynucleotide fragments. Amplification of the cDNA insert in clone epb3c.pk007.n11 was accomplished using oligonucleotide primers SST-3 (shown in SEQ ID NO:8) and SST-4 (shown in SEQ ID NO:9). The resulting polynucleotide encodes an entire guayule 1-SST including secretory and vacuolar targeting signals and its sequence is shown in SEQ ID NO:10. (SEQ ID NO:8) 5′-AAGCTTGCGGCCGCGCCATGGCTTCMTCHACCACC-3′ (SEQ ID NO: 9) 5′-AAGCTTCTCGAGGCGGCCGCTCAAGAAGTCCACCCAGTAAC-3′

[0180] Amplification of the polynucleotide encoding the guayule 1-FFT in clone epb3c.pk007.j9 was performed using the oligonucleotide primers FFT-3 (shown in SEQ ID NO:11) and FFT-4 (shown in SEQ ID NO:12). The resulting polynucleotide fragment encodes an entire guayule 1-FFT including secretory and vacuolar targeting signals and its sequence is shown in SEQ ID NO:13. FFT-3: 5′-AAGCTTGCGGCCGCACCATGGCAACCCCTGAACAACCC-3′ (SEQ ID NO:11) FFT-4: 5′-AAGCTTCTCGAGGCGGCCGCCTAATTAAACTCGTATTGATG-3′ (SEQ ID NO:12)

[0181] Assembly of Vectors for the Expression of Guayule 1-SST and 1-FFT

[0182] Preparation of pJMS02: The polynucleotide product obtained from amplification of clone epb3c.pk007.n 11 encoding guayule 1-SST was digested with Not I and assembled into vector pJMS02 (shown in FIG. 9) by the following steps. First, the commercially-available plasmid pSP72 (Promega Biotech, Madison, Wis.) was modified to create plasmid pSP72a. Plasmid pSP72 consisted of deletion of the fragment corresponding to the beta lactamase coding region (nucleotides 1135 through 1995), insertion of a polynucleotide fragment comprising the E. coli RNA polymerase T7 promoter operably linked to a polynucleotide encoding HPT the E. coli RNA polymerase T7 promoter and transcription termination, and inserting polynucleotide fragments for the KTi3 promoter and KTi3 transcription termination region. HPT and its function under the control of a bacterial promoter is explained in Example 2. The KTi3 promoter and 3′ transcription terminator region have been described by Jofuku et al. [(1989) Plant Cell 1:1079-1093]. The KTI3 promoter directs strong embryo-specific expression of transgenes. Then, the isolated DNA fragment containing the guayule SST was inserted into Not 1-digested plasmid pSP72a to obtain plasmid pJMS02 the sequence of which is shown in SEQ ID NO:14.

[0183] Preparation of pRM03: Vector pRM03 comprises nucleotides encoding guayule FFT under the control of a KTi3 promoter and termination signals and nucleotides encoding HPT under control of the T7 promoter and termination signals. To produce vector pRM03 the polynucleotide product encoding guayule 1-FFT obtained from amplification of clone epb3c.pk007.j9 was digested with Not I and used to replace the 1-SST polynucleotide fragment from clone pJMS02 to create plasmid pRM03. The 1-SST polynucleotide fragment had been removed from pJMS02 by digestion with Not I. Vector pRM03 is depicted in FIG. 10 and contains two expression cassettes. One cassette contains the KTi3 promoter directing the expression of the guayule 1-FFT (including secretory and vacuolar targeting signals) followed by the KTi3 transcription terminator. Another cassette comprises the E. coli RNA polymerase T7 promoter directing the expression of HPT followed by the T7 transcription terminator. The polynucleotide sequence of vector pRM03 is shown in SEQ ID NO:15.

[0184] Preparation of pJMS11: Vector pJMS01 comprises nucleotides encoding guayule 1-FFT under the control of the beta conglycinin promoter and phaseolin terminator. This vector also comprises nucleotides encoding HPT under the control of the T7 promoter and termination signals and the 35S promoter and Nos 3′ terminator. To produce vector pJMS01 the polynucleotide product encoding guayule 1-FFT obtained from amplification of clone epb3c.pk007.j9 was digested with Not I and inserted into Not I-digested soybean expression vector pKS123 to generate the vector pJMS01 (depicted in FIG. 11). Vector pKS123 contains a cassette for the expression of HPT under the CaMV 35S promoter and nos 3′ end and a cassette comprising a β-conglycinin promoter and the phaseolin 3′ transcription terminator separated by a Not I restriction endonuclease site. To prepare vector pKS123, a cassette comprising a CaMV 35S promoter directing the expression of HPT followed by a nos 3′ end, and flanked on either side with Sal I sites was introduced into vector pKS17 (described in Example 2). This modified vector pKS17 was digested with Xho I and Sal I followed by treatment with mung bean nuclease (to make blunt the resulting ends) and a linker primer introduced. The sequence of this linker primer is shown in SEQ ID NO:16 and contains, in a 5′ to 3′ orientation processing sites for the restriction endonucleases Asc I, Hind III, Bam HI, Sal I, and Asc I.

[0185] 5′-GGCGCGCCAAGCTTGGATCCGTCGACGGCGCGCC-3′ (SEQ ID NO:16)

[0186] After ligation the modified vector was digested with Hind III and the cassette comprising the β-conglycinin promoter and phaseolin 3′ transcription terminator separated by a Not I restriction endonuclease site was added to form vector KS123. The CaMV 35S promoter has been described by Odell et al. ((1985) Nature 313:810-812; and Hull et al. (1987) Virology 86:482-493). The nopaline synthase transcription terminator has been described by Depicker et. al. ((1982) J. Appl. Genet. 1:561-574). The β-conglycinin promoter fragment is an allele of the β-conglycinin promoter published by Doyle et al. ((1986) J. Biol. Chem. 261:9228-9238). The 1175 base pair phaseolin transcription terminator has been described by Doyle et al. ((1986) J. Biol. Chem. 261:9228-9238; and Slightom et al. (1983) Proc. Natl. Acad. Sci. USA 80:1897-1901). The amplified guayule 1-FFT was digested with Not I and introduced into Not 1-digested vector pKS123 to form plasmid pJMS01. Plasmid pJMS01 contains then, β-conglycinin promoter operably linked to the guayule 1-FFT, operably linked to the phaseolin transcription terminator. Plasmid pJMS01 also contains fragments for the expression of HPT in bacteria (under the control of the T7 promoter) and in eukaryotic systems (under the control of the CaMV 35S promoter). These two cassettes allow for selection of transformed cells in bacterial and plant systems in the presence of hygromycin. The nucleotide sequence of plasmid pJMS01 is shown in SEQ ID NO:17.

[0187] Preparation of pRM02: Vector pRM02 (shown in FIG. 12) comprises nucleotides encoding guayule SST under the control of the beta conglycinin promoter and phaseolin 3′ terminator and nucleotides encoding HPT under was prepared by replacing the polynucleotide fragment encoding guayule 1-FFT from plasmid pJMS01 with the polynucleotide fragment encoding guayule 1-SST in plasmid pJMS02. Removal of the guayule 1-FFT and 1-SST fragments was accomplished by digestion with Not I. Plasmid pRM02 is depicted in FIG. 12 and contains the β-conglycinin promoter operably linked to the guayule 1-SST, operably linked to the phaseolin transcription terminator. Plasmid pRM02 also contains cassettes for the expression of HpT in bacterial and plant systems useful for selection of transformed cells. The nucleotide sequence of plasmid pRM02 is shown in SEQ ID NO:18.

[0188] Preparation of pRM01: Vector pRM01 comprises nucleotides encoding guayle 1-SST under control of the KTi3 promoter and termination signals and nucleotides encoding guayule 1-FFT under control of the beta conglycinin promoter and phaseolin terminator. Vector pRM01 also comprises nucleotides encoding HPT under the control of the 35S promoter and nos terminator and T7 promoter and terminator. Plasmid pRM01 (shown in FIG. 13) was constructed by removing the polynucleotide fragment containing the KTi3 promoter/guayule 1-SST coding region/KTi transcription terminator cassette from plasmid pJMS02 and transferring it to plasmid pJMS01. The KTi3 promoter/guayule 1-SST coding region/KTi transcription terminator cassette was removed from plasmid pJMS02 by digestion with Bam HI, which cuts immediately upstream of the KTi3 promoter, and Sal I, which cuts in the KTi3 transcription terminator region. Plasmid pJMS01 was digested with Bam HI and Sal I, which cut between the T7 and phaseolin terminator regions. Plasmid pRM01 contains the polynucleotide encoding guayule 1-SST under the control of the KTi3 promoter and transcription terminator and the polynucleotide encoding guayule 1-FFT under the control of the phaseolin promoter and transcription terminator. Plasmid pRM01 also contains cassettes for the expression of HPT in bacterial and plant systems useful for selection of transformed cells. The polynucleotide sequence of plasmid pRM01 is shown in SEQ ID NO:19.

[0189] Preparation of pRM04: Vector pRM04 comprises nucleotides encoding guayule 1-SST under the control of the beta conglycinin promoter and phaseolin terminator and nucleotides encoding guayule 1-FFT under control of the KT13 promoter and terminator. Vector pRM04 also comprises nucleotides encoding HPT under the control of the T7 promoter and termination signals and the 35S promoter and nos terminator. Plasmid pRM04 (shown in FIG. 14) was constructed by transferring the KTi3 promoter/guayule 1-FFT coding region/KTi3 transcription terminator cassette from pasmid pRM03 to plasmid pRM02. Digestion with Bam HI and Sal I was used to remove the 1-FFT expression cassette from plasmid pRM03 and insert it between the T7 and the phaseolin transcription terminators. Plasmid pRM04 contains the polynucleotide encoding guayule 1-SST under control of the beta conglycinin promoter and phaseolin transcription terminator and the polynucleotide encoding guayule 1-FFT under control of the KTi3 promoter and transcription terminator. Plasmid pRM04 also contains cassettes for the expression of HPT in bacterial and plant systems useful for selection of transformed cells. The polynucleotide sequence of plasmid pRM04 is shown in SEQ ID NO:20.

Example 7 Transformation of Soybean Somatic Embryos with Guayule SST and FFT Expression Vectors

[0190] To study the possibility of producing inulin in soybeans, soybean somatic embryos were transformed with the seed-specific expression vectors expressing the guayule 1-SST and 1-FFT. Soybean somatic embryos were transformed with plasmids pJMS01 and pJMS02, plasmids pRM02 and pRM03, plasmid pRM01, or plasmid pRM04 by the method of particle gun bombardment (Klein, T. M. et al. (1987) Nature (London) 327:70-73; U.S. Pat. No. 4,945,050).

[0191] Soybean somatic embryos from the Jack cultivar were induced as follows. Cotyledons (3 mm in length) were dissected from surface sterilized, immature seeds and were cultured for an additional 6-10 weeks in the light at 26° C. on a Murashige and Skoog media containing 7 g/L agar and supplemented with 10 mg/mL 2,4-D. Globular stage somatic embryos, which produced secondary embryos, were then excised and placed into flasks containing liquid MS medium supplemented with 2,4-D (10 mg/mL) and cultured in the light on a rotary shaker. After repeated selection for clusters of somatic embryos that multiplied as early, globular staged embryos, the soybean embryogenic suspension cultures were maintained in 35 mL liquid media on a rotary shaker, 150 rpm, at 26° C. with fluorescent lights on a 16:8 hour day/night schedule. Cultures were subcultured every two weeks by inoculating approximately 35 mg of tissue into 35 mL of liquid medium.

[0192] Soybean embryogenic suspension cultures were then transformed by the method of particle gun bombardment (Klein, T. M., et al. (1987) Nature (London) 327:70-73, U.S. Pat. No. 4,945,050) using a DuPont Biolistic™ PDS1000/HE instrument (helium retrofit). To 50 μL of a 60 mg/μL 1 mm gold particle suspension were added (in order): 5 μL of 1 mg/μL DNA (pJMS01 plus pJMS02, pRM02 plus pRM03, pRM01, or pRM04), 20 μl of 0.1 M spermidine, and 50 μL of 2.5 M CaCl₂. The particle preparation was then agitated for three minutes, spun in a microfuge for 10 seconds and the supernatant removed. The DNA-coated particles were then washed once in 400 μL 70% ethanol and resuspended in 40 μL of anhydrous ethanol. The DNA/particle suspension was sonicated three times for one second each. Five μL of the DNA-coated gold particles was then loaded on each macro carrier disk.

[0193] Approximately 300-400 mg of a two-week-old suspension culture was placed in an empty 60×15 mm Petri dish and the residual liquid removed from the tissue with a pipette. For each transformation experiment, approximately 5 to 10 plates of tissue were bombarded. Membrane rupture pressure was set at 1100 psi and the chamber was evacuated to a vacuum of 28 inches mercury. The tissue was placed approximately 3.5 inches away from the retaining screen and bombarded three times. Following bombardment, the tissue was divided in half and placed back into liquid and cultured as described above.

[0194] Five to seven days post bombardment, the liquid media was exchanged with fresh media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL hygromycin. This selective media was refreshed weekly. Seven to eight weeks post bombardment, green, transformed tissue was observed growing from untransformed, necrotic embryogenic clusters. Isolated green tissue was removed and inoculated into individual flasks to generate new, clonally propagated, transformed embryogenic suspension cultures. Each new line was treated as an independent transformation event. These suspensions were then subcultured and maintained as clusters of immature embryos. Storage products produced by immature embryos at this stage are similar in composition to storage products produced by zygotic embryos at a similar stage of development (see PCT Publication No. WO 94/11516, published May 26, 1994).

Example 8 Carbohydrate Analysis of Transgenic Soybean Somatic Embryos

[0195] The carbohydrate profiles of soybean somatic embryos resulting from transformation with the vectors containing cassettes for embryo-specific expression of guayule 1-SST and 1-FFT were determined following the protocol shown in Example 5.

[0196]FIG. 15 shows the carbohydrate profile resulting from HPAE/PAD analysis of transgenic soybean somatic embryos expressing guayule 1-SST and 1-FFT. These embryos express the guayule 1-SST and 1-FFT from vectors pRM02 and pRM03, and accumulate inulin of DP3 to DP5. These compounds are absent in soybean somatic embryos that were not transformed with the expression cassettes (negative controls) but were generated using the same procedures as described in Example 7 (FIG. 16). Similar profiles were observed for somatic embryos expressing both, guayule 1-SST and 1-FFT, resulting from transformations using pRM01 or RM04.

[0197] The data presented here shows, for the first time, that transgenic soybean embryos are capable of producing inulin when expressing 1-SST and 1-FFT. The polynucleotides used in the present application to transform soybean encode 1-SST and 1-FFT enzymes that are normally expressed in the bark tissue of the rubber tree. The Jerusalem artichoke SST and FFT, used here to transform corn, are expressed in the tuber.

[0198] Corn kernels accumulate longer inulin than maize somatic embryos, as shown in Example 5, it is expected that soybean seeds will produce more and longer inulin than the somatic embryos.

[0199] In the examples above, it is demonstrated that embryo-specific expression of 1-SST and 1-FFT results in inulin accumulation in plants that normally do not accumulate this carbohydrate. Furthermore, that embryo-specific expression of Jerusalem artichoke 1-SST and 1-FFT results in larger accumulation of inulin than the previously shown endosperm-specific expression of the same 1-SST and 1-FFT. It is also demonstrated that embryo-specific expression of guayule 1-SST and 1-FFT in soybean somatic embryos results production of inulin. Considering that the guayule enzymes are normally expressed in the bark tissue of the rubber tree, expression in the embryo was unexpected. The examples above also show that analysis of somatic embryos for a given trait allows an accurate and quick method for determining successful transformation events.

Example 9 Carbohydrate Analyses of Transgenic Soybean Embryos and Seeds

[0200] Carbohydrate profiles of dried-down transgenic somatic soybean embryos and of transgenic soybean seeds expressing guayule 1-SST and 1-FFT from plasmid pRM01 were obtained. Somatic soybean embryos were transformed with vector pRM01 as described in Example 7. Immature transgenic somatic soybean embryos expressing guayule 1-SST and 1-FFT were selected as in Example 1. The somatic soybean embryos were dried-down to mimic the last stages of soybean seed development. Dried-down embryos are capable of producing plants when transferred to soil or soil-less media.

[0201] Analysis of Dried Transgenic Soybean Embryos

[0202] Immature transgenic somatic soybean embryos expressing guayule 1-SST and 1-FFT were dried-down to mimic the last stages of soybean development especially the seed dry down phase. To dry-down, somatic embryos were transferred to an empty petri dish, covered, and put in a second petri dish containin modified MS medium (described in Example 1) and allowed to dry for 2 to 5 days. The carbohydrate profile of dried-down individual somatic embryos was determined essentially as described in Example 5, with minor modifications. The analysis was modified by using a CarboPac PA100 anion exchange column and guard column which enabled inulin detection of DP>15.

[0203] A typical carbohydrate profile obtained for dried-down soybean embryos expressing guayule 1-SST and 1-FFT from vector pRM01 is shown in FIG. 17. This carbohydrate profile clearly shows that inulin, of DP 3 to at least DP30, is detected in dried-down soybean somatic embryos. No inulin is detectable in dried-down soybean embryos that have gone through the same process but do not express 1-SST or 1-FFT. This is the first time where soybean somatic embryos are shown to produce fructans. These data support the fact that the guayule 1-SST and 1-FFT are expressed and active in soybean.

[0204] Carbohydrate Analysis of Mature Seeds from Transgenic Soybean

[0205] Somatic embryos dried-down as described above were transferred to a soil-less mixture to enable their development into plants. Transgenic plants from all transformation events were allowed to set seed and individual mature seeds were obtained. The carbohydrate profile of mature soybean seeds was determined as described in Example 5. A typical carbohydrate profile of individual seeds from transgenic soybean plants expressing intact copies of the guayule 1-SST and 1-FFT from vector pRM01 is shown in FIG. 18. This Figure shows that transgenic soybean seeds expressing the guayule 1-SST and 1-FFT accumulated inulin-type fructose polymers of DP 3 through at least DP 30. It is possible that accumulation of inulins of DP larger than 30 may still occur but their levels fall below current detection limits. The carbohydrate profile resulting from HPAE/PAD analysis of chips from seeds from transgenic soybean plants not expressing guayule 1-SST and 1-FFT is shown in FIG. 19 where no fructans are detected.

[0206] The results presented above show that fructans may be produced in soybean somatic embryos and that these embryos are capable of developing into soybean plants that produce seeds that make fructans. Furthermore they also show that, as with corn, a phenotypic seed trait may be identified at the soybean somatic embryo stage and the same trait will be present in seed from the mature plants.

1 20 1 3421 DNA Artificial Cassette (1)..(3421) Jerusalem artichoke SST expression cassette 1 gtcgacgata tcggatcctc tagacccggg aagcttgaga caggagataa aagtagaaac 60 tggatacaac actttgtaac atagtgacac tctctccttc ctttctttta ccttagaact 120 atacatagaa tctacattca ataaaaatac agtaggtacg ccgagagatt taaaatgagt 180 aagctaacat accaactaag gccctgtttg tttcggatta taatctctcc agattatata 240 atccagcgta aataattcag cagataaaca aacacctaaa ttatatgttc agattatata 300 atctatagcg gagattatga taatctcgta atctcctaag agtagcttat ttgagatttt 360 tttggcaaaa gacccactac cgttatgtaa atagaattac aatatatgac atccttcttt 420 cttcacctca aataaacaaa caagggtact gttgctttat gaataatcta catttatata 480 atctagacta acaaacaact acatatagat tataatacgt ctagattata atctagatta 540 tataatttaa attatagtcg atattatata atctataagc taaaacaaat agcccctaat 600 tattaggcta ttagttgtta ggctatttaa atctaagcgt aaaacgaact aatagcttat 660 tagttgaatt acaattagct caacggaatt ctctgttttt ctaaaaaaaa actgcccctc 720 tcttacagca aattgtccgc tgcccgtcgt ccagatacaa tgaacgtacc tagtaggaac 780 tcttttacac gctcggtcgc tcgccgcgga tcggagtccc cggaacacga caccactgtg 840 gaacacgaca aagtctgctc agaggcggcc acaccctggc gtgcaccgag ccggagcccg 900 gataagcacg gtaaggagag tacggcggga cgtggcgacc cgtgtgtctg ctgccacgca 960 gccttcctcc acgtagccgc gcggccgcgc cacgtaccag ggcccggcgc tggtataaat 1020 gcgcgccacc tccgctttag ttctgcatac agctagccaa cccaacacac acccgagcat 1080 atcacagtga cagacactac accatggtgg tttcatccac caccaccacc cctctcattc 1140 tccatgatga ccctgaaaac ctcccagaac tcaccggttc tccgacaact cgtcgtctat 1200 ccatcgcaaa agtgctttcg gggatccttg tttcggttct ggttataggt gctcttgttg 1260 ctttaatcaa caaccaaaca tatgaatccc cctcggccac cacattcgta actcagttgc 1320 caaatattga tctgaagcgg gttccaggaa agttggattc gagtgctgag gttgaatggc 1380 aacgatccac ttatcatttt caacccgaca aaaatttcat tagcgatcct gatggcccaa 1440 tgtatcacat gggatggtat catctatttt atcagtacaa ccctcaatct gccatctggg 1500 gaaacatcac atggggccac tcggtatcga aagacatgat caactggttc catctccctt 1560 tcgccatggt tcctgaccat tggtacgaca tcgaaggtgt catgacgggt tcggctacag 1620 tcctccctaa tggtcaaatc atcatgcttt actcgggcaa cgcgtatgat ctctcccaag 1680 tacaatgctt ggcgtacgct gtcaactcgt cggatccact tcttatagag tggaaaaaat 1740 atgaaggtaa ccctgtctta ctcccaccac caggagtagg ctacaaggac tttcgggacc 1800 catccacatt gtggtcgggc cctgatggtg aatatagaat ggtaatgggg tccaagcaca 1860 acgagactat tggctgtgct ttgatttacc ataccactaa ttttacgcat tttgaattga 1920 aagaggaggt gcttcatgca gtcccacata ctggtatgtg ggaatgtgtt gatctttacc 1980 cggtgtccac cgtacacaca aacgggctgg acatggtgga taacgggcca aatgttaagt 2040 acgtgttgaa acaaagtggg gatgaagatc gccatgattg gtatgcaatt ggaagttacg 2100 atatagtgaa tgataagtgg tacccagatg acccggaaaa tgatgtgggt atcggattaa 2160 gatatgattt tggaaaattt tatgcgtcca agacgtttta tgaccaacat aagaagagga 2220 gagtcctttg gggctatgtt ggagaaaccg atccccaaaa gtatgacctt tccaagggat 2280 gggctaacat tttgaatatt ccaaggaccg tcgttttgga cctcgaaact aaaaccaatt 2340 tgattcaatg gccaatcgag gaaaccgaaa accttaggtc gaaaaagtat gatgaattta 2400 aagacgtcga acttcgaccc ggggcactcg ttccccttga gataggcaca gccacacagt 2460 tggatatagt tgcgacattc gaaatcgacc aaaagatgtt ggaatcaacg ctagaggccg 2520 atgttctatt caattgcacg actagtgaag gctcggttgc aaggagtgtg ttgggaccgt 2580 ttggtgtggt ggttctagcc gatgcccagc gctccgaaca acttcctgta tacttctata 2640 tcgcaaaaga tattgatgga acctcacgaa cttatttttg tgccgacgaa acaagatcat 2700 ccaaggatgt aagcgtaggg aaatgggtgt acggaagcag tgttcctgtc ctccctggcg 2760 aaaagtacaa tatgaggtta ttggtggatc attcgatagt agagggattt gcacaaaacg 2820 ggagaaccgt ggtgacatca agagtgtatc caacaaaggc gatctacaac gctgcgaagg 2880 tgtttttgtt caacaacgcg actggaatca gtgtgaaggc gtcgatcaag atctggaaga 2940 tgggggaagc agaactcaat cctttccctc ttcctgggtg gactttcgaa ctttgatggt 3000 tatattttgg accctatata tgtgttatta tcatgatggt tatattttgg accctatata 3060 tgtgttatta tcatgaagca taagtttgga ctggaggggg tattattgta attttatatg 3120 catgttctat tacttgatga tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 3180 attgaatcct gttgccggtc ttgcgatgat tatcatataa tttctgttga attacgttaa 3240 gcatgtaata attaacatgt aatgcatgac gttatttatg agatgggttt ttatgattag 3300 agtcccgcaa ttatacattt aatacgcgat agaaaacaaa atatagcgcg caaactagga 3360 taaattatcg cgcgcggtgt catctatgtt actagatcgg gaattgccaa gcttcagctg 3420 c 3421 2 3245 DNA Artificial Cassette (1)..(3245) Jerusalem Artichoke FFT expression cassette 2 actagtggat cccccgggaa gcttgagaca ggagataaaa gtagaaactg gatacaacac 60 tttgtaacat agtgacactc tctccttcct ttcttttacc ttagaactat acatagaatc 120 tacattcaat aaaaatacag taggtacgcc gagagattta aaatgagtaa gctaacatac 180 caactaaggc cctgtttgtt tcggattata atctctccag attatataat ccagcgtaaa 240 taattcagca gataaacaaa cacctaaatt atatgttcag attatataat ctatagcgga 300 gattatgata atctcgtaat ctcctaagag tagcttattt gagatttttt tggcaaaaga 360 cccactaccg ttatgtaaat agaattacaa tatatgacat ccttctttct tcacctcaaa 420 taaacaaaca agggtactgt tgctttatga ataatctaca tttatataat ctagactaac 480 aaacaactac atatagatta taatacgtct agattataat ctagattata taatttaaat 540 tatagtcgat attatataat ctataagcta aaacaaatag cccctaatta ttaggctatt 600 agttgttagg ctatttaaat ctaagcgtaa aacgaactaa tagcttatta gttgaattac 660 aattagctca acggaattct ctgtttttct aaaaaaaaac tgcccctctc ttacagcaaa 720 ttgtccgctg cccgtcgtcc agatacaatg aacgtaccta gtaggaactc ttttacacgc 780 tcggtcgctc gccgcggatc ggagtccccg gaacacgaca ccactgtgga acacgacaaa 840 gtctgctcag aggcggccac accctggcgt gcaccgagcc ggagcccgga taagcacggt 900 aaggagagta cggcgggacg tggcgacccg tgtgtctgct gccacgcagc cttcctccac 960 gtagccgcgc ggccgcgcca cgtaccaggg cccggcgctg gtataaatgc gcgccacctc 1020 cgctttagtt ctgcatacag ctagccaacc caacacacac ccgagcatat cacagtgaca 1080 gacactacac catggaaacc cctgaaccct ttacagacct tgaacatgaa ccccacacac 1140 ccctactgga ccaccaccac aacccaccac cacaaaccac cacaaaacct ttgttcacca 1200 gggttgtgtc cggtgtcacc tttgttttat tcttctttgc tttcgctatc gtattcattg 1260 ttctcaacca acagaattct tctgttcgta tcgtcaccaa ttcggagaaa tcttttataa 1320 ggtattcgca gaccgatcgc ttgtcgtggg aacggaccgc ttttcatttt cagcctgcca 1380 agaattttat ttacgatcca gatggtcagt tgtttcacat gggctggtac catatgttct 1440 atcaatacaa cccatacgca ccggtttggg gcaatatgtc atggggtcac tcagtgtcca 1500 aagacatgat caactggtac gagctgccag tcgctatggt cccgaccgaa tggtatgata 1560 tcgagggcgt cttatccggg tctaccacgg tccttccaaa cggtcagatc tttgcattgt 1620 atactgggaa cgctaatgat ttttcccaat tacaatgcaa agctgtaccc gtaaacttat 1680 ctgacccgct tcttattgag tgggtcaagt atgaggataa cccaatcctg tacactccac 1740 cagggattgg gttaaaggac tatcgggacc cgtcaacagt ctggacaggt cccgatggaa 1800 agcataggat gatcatggga actaaacgtg gcaatacagg catggtactt gtttactata 1860 ccactgatta cacgaactac gagttgttgg atgagccgtt gcactctgtt cccaacaccg 1920 atatgtggga atgcgtcgac ttttacccgg tttcgttaac caatgatagt gcacttgata 1980 tggcggccta tgggtcgggt atcaaacacg ttattaaaga aagttgggag ggacatggaa 2040 tggattggta ttcaatcggg acatatgacg cgataaatga taaatggact cccgataacc 2100 cggaactaga tgtcggtatc gggttacggt gcgattacgg gaggtttttt gcatcaaaga 2160 gtctttatga cccattgaag aaaaggagga tcacttgggg ttatgttgga gaatcagata 2220 gtgctgatca ggacctctct agaggatggg ctactgttta taatgttgga agaacaattg 2280 tactagatag aaagaccggg acccatttac ttcattggcc cgttgaggaa gtcgagagtt 2340 tgagatacaa cggtcaggag tttaaagaga tcaagctaga gcccggttca atcattccac 2400 tcgacatagg cacggctaca cagttggaca tagttgcaac atttgaggtg gatcaagcag 2460 cgttgaacgc gacaagtgaa accgatgata tttatggttg caccactagc ttaggtgcag 2520 cccaaagggg aagtttggga ccatttggtc ttgcggttct agccgatgga accctttctg 2580 agttaactcc ggtttatttc tatatagcta aaaaggcaga tggaggtgtg tcgacacatt 2640 tttgtaccga taagctaagg tcatcactag attatgatgg ggagagagtg gtgtatgggg 2700 gcactgttcc tgtgttagat gatgaagaac tcacaatgag gctattggtg gatcattcga 2760 tagtggaggg gtttgcgcaa ggaggaagga cggttataac atcaagggcg tatccaacaa 2820 aagcgatata cgaacaagcg aagctgttct tgttcaacaa cgccacaggt acgagtgtga 2880 aagcatctct caagatttgg caaatggctt ctgcaccaat tcatcaatac cctttttaat 2940 taccggctat cgggatcccc gatcgttcaa acatttggca ataaagtttc ttaagattga 3000 atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg 3060 taataattaa catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc 3120 cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat 3180 tatcgcgcgc ggtgtcatct atgttactag atcgggaatt gccaagctta tcgataccgt 3240 cgacc 3245 3 2116 DNA Artificial cassette (1)..(2116) bar gene expression cassette 3 agcttgcatg cctgcaggtc ctgctgagcc tcgacatgtt gtcgcaaaat tcgccctgga 60 cccgcccaac gatttgtcgt cactgtcaag gtttgacctg cacttcattt ggggcccaca 120 tacaccaaaa aaatgctgca taattctcgg ggcagcaagt cggttacccg gccgccgtgc 180 tggaccgggt tgaatggtgc ccgtaacttt cggtagagcg gacggccaat actcaacttc 240 aaggaatctc acccatgcgc gccggcgggg aaccggagtt cccttcagtg aacgttatta 300 gttcgccgct cggtgtgtcg tagatactag cccctggggc cttttgaaat ttgaataaga 360 tttatgtaat cagtctttta ggtttgaccg gttctgccgc tttttttaaa attggatttg 420 taataataaa acgcaattgt ttgttattgt ggcgctctat catagatgtc gctataaacc 480 tattcagcac aatatattgt tttcatttta atattgtaca tataagtagt agggtacaat 540 cagtaaattg aacggagaat attattcata aaaatacgat agtaacgggt gatatattca 600 ttagaatgaa ccgaaaccgg cggtaaggat ctgagctaca catgctcagg ttttttacaa 660 cgtgcacaac agaattgaaa gcaaatatca tgcgatcata ggcgtctcgc atatctcatt 720 aaagcaggac tctagatctc ggtgacgggc aggaccggac ggggcggtac cggcaggctg 780 aagtccagct gccagaaacc cacgtcatgc cagttcccgt gcttgaagcc ggccgcccgc 840 agcatgccgc ggggggcata tccgagcgcc tcgtgcatgc gcacgctcgg gtcgttgggc 900 agcccgatga cagcgaccac gctcttgaag ccctgtgcct ccagggactt cagcaggtgg 960 gtgtagagcg tggagcccag tcccgtccgc tggtggcggg gggagacgta cacggtcgac 1020 tcggccgtcc agtcgtaggc gttgcgtgcc ttccaggggc ccgcgtaggc gatgccggcg 1080 acctcgccgt ccacctcggc gacgagccag ggatagcgct cccgcagacg gacgaggtcg 1140 tccgtccact cctgcggttc ctgcggctcg gtacggaagt tgaccgtgct tgtctcgatg 1200 tagtggttga cgatggtgca gaccgccggc atgtccgcct cggtggcacg gcggatgtcg 1260 gccgggcgtc gttctgggct catggatccg atttgtagag agagactggt gatttcagcg 1320 tgtcctctcc aaatgaaatg aacttcctta tatagaggaa gggtcttgcg aaggatagtg 1380 ggattgtgcg tcatccctta cgtcagtgga gatatcacat caatccactt gctttgaaga 1440 cgtggttgga acgtcttctt tttccacgat gctcctcgtg ggtgggggtc catctttggg 1500 accactgtcg gcagaggcat cttgaacgat agcctttcct ttatcgcaat gatggcattt 1560 gtaggtgcca ccttcctttt ctactgtcct tttgatgaag tgacagatag ctgggcaatg 1620 gaatccgagg aggtttcccg atattaccct ttgttgaaaa gtctcaatag ccctttggtc 1680 ttctgagact gtatctttga tattcttgga gtagacgaga gtgtcgtgct ccaccatgtt 1740 gacgaagatt ttcttcttgt cattgagtcg taaaagactc tgtatgaact gttcgccagt 1800 cttcacggcg agttctgtta gatcctcgat ctgaattttt gactccatgg cctttgattc 1860 agtaggaact actttcttag agactccaat ctctattact tgccttggtt tatgaagcaa 1920 gccttgaatc gtccatactg gaatagtact tctgatcttg agaaatatat ctttctctgt 1980 gttcttgatg cagttagtcc tgaatctttt gactgcatct ttaaccttct tgggaaggta 2040 tttgatctcc tggagattat tactcgggta gatcgtcttg atgagacctg ccgcgtaggc 2100 tagaggatcc ccggga 2116 4 25 DNA Artificial primer (1)..(25) oligonucleotide primer 4 ttcgtaactc agttgccaaa tattg 25 5 22 DNA Artificial primer (1)..(22) Oligonucleotide primer 5 ccagcccgtt tgtgtgtacg gt 22 6 21 DNA Artificial primer (1)..(21) Oligonucleotide primer 6 gttcgtatcg tcaccaattc g 21 7 21 DNA Artificial primer (1)..(21) Oligonucleotide primer 7 gtgcactatc attggttaac g 21 8 35 DNA Artificial primer (1)..(35) Oligonucleotide primer 8 aagcttgcgg ccgcgccatg gcttcmtcha ccacc 35 9 41 DNA Artificial primer (1)..(41) Oligonucleotide primer 9 aagcttctcg aggcggccgc tcaagaagtc cacccagtaa c 41 10 2034 DNA Parthenium argentatum 10 atggtggttt catccaccac caccacccct ctcattctcc atgatgaccc tgaaaacctc 60 ccagaactca ccggttctcc gacaactcgt cgtctatcca tcgcaaaagt gctttcgggg 120 atccttgttt cggttctggt tataggtgct cttgttgctt taatcaacaa ccaaacatat 180 gaatccccct cggccaccac attcgtaact cagttgccaa atattgatct gaagcgggtt 240 ccaggaaagt tggattcgag tgctgaggtt gaatggcaac gatccactta tcattttcaa 300 cccgacaaaa atttcattag cgatcctgat ggcccaatgt atcacatggg atggtatcat 360 ctattttatc agtacaaccc tcaatctgcc atctggggca acatcacatg gggccactcg 420 gtatcgaaag acatgatcaa ctggttccat ctccctttcg ccatggttcc tgaccattgg 480 tacgacatcg aaggtgtcat gacgggttcg gctacagtcc tccctaatgg tcaaatcatc 540 atgctttact cgggcaacgc gtatgatctc tcccaagtac aatgcttggc gtacgctgtc 600 aactcgtcgg atccacttct tatagagtgg aaaaaatatg aaggtaaccc tgtcttactc 660 ccaccaccag gagtaggcta caaggacttt cgggacccat ccacattgtg gtcgggccct 720 gatggtgaat atagaatggt aatggggtcc aagcacaacg agactattgg ctgtgctttg 780 atttaccata ccactaattt tacgcatttt gaattgaaag aggaggtgct tcatgcagtc 840 ccacatactg gtatgtggga atgtgttgat ctttacccgg tgtccaccgt acacacaaac 900 gggctggaca tggtggataa cgggccaaat gttaagtacg tgttgaaaca aagtggggat 960 gaagatcgcc atgattggta tgcaattgga agttacgata tagtgaatga taagtggtac 1020 ccagatgacc cggaaaatga tgtgggtatc ggattaagat atgattttgg aaaattttat 1080 gcgtccaaga cgttttatga ccaacataag aagaggagag tcctttgggg ctatgttgga 1140 gaaaccgatc cccaaaagta tgacctttca aagggatggg ctaacatttt gaatattcca 1200 aggaccgtcg ttttggacct cgaaactaaa accaatttga ttcaatggcc aatcgaggaa 1260 accgaaaacc ttaggtcgaa aaagtatgat gaatttaaag acgtcgagct tcgacccggg 1320 gcactcgttc cccttgagat aggcacagcc acacagttgg atatagttgc gacattcgaa 1380 atcgaccaaa agatgttgga atcaacgcta gaggccgatg ttctattcaa ttgcacgact 1440 agtgaaggct cggttgcaag gagtgtgttg ggaccgtttg gtgtggtggt tctagccgat 1500 gcccagcgct ccgaacaact tcctgtatac ttctatatcg caaaagatat tgatggaacc 1560 tcacgaactt atttttgtgc cgacgaaaca agatcatcca aggatgtaag cgtagggaaa 1620 tgggtgtacg gaagcagtgt tcctgtcctc ccaggcgaaa agtacaatat gaggttattg 1680 gtggatcatt cgatagtaga gggatttgca caaaacggga gaaccgtggt gacatcaaga 1740 gtgtatccaa caaaggcgat ctacaacgct gcgaaggtgt ttttgttcaa caacgcgact 1800 ggaatcagtg tgaaggcgtc gatcaagatc tggaagatgg gggaagcaga actcaatcct 1860 ttccctcttc ctgggtggac tttcgaactt tgatggttat attttggacc ctatatatgt 1920 gttattatca tgatggttat attttggacc ctatatatgt gttattatca tgaagcataa 1980 gtttggactg gagggggtat tattgtaatt ttatatgcat gttctattac ttga 2034 11 38 DNA Artificial primer (1)..(38) Oligonucleotide primer 11 aagcttgcgg ccgcaccatg gcaacccctg aacaaccc 38 12 41 DNA Artificial primer (1)..(41) Oligonucleotide primer 12 aagcttctcg aggcggccgc ctaattaaac tcgtattgat g 41 13 1862 DNA Parthenium argentatum 13 atggaaaccc ctgaaccctt tacagacctt gaacatgaac cccacacacc cctactggac 60 caccaccaca acccaccacc acaaaccacc acaaaacctt tgttcaccag ggttgtgtcc 120 ggtgtcacct ttgttttatt cttctttggt ttcgctatcg tattcattgt tctcaaccaa 180 cagaattctt ctgttcgtat cgtcaccaat tcggagaaat cttttataag gtattcgcag 240 accgatcgct tgtcgtggga acggaccgct tttcattttc agcctgccaa gaattttatt 300 tacgatccag atggtcagtt gtttcacatg ggctggtacc atatgttcta tcaatacaac 360 ccatacgcac cggtttgggg caatatgtca tggggtcact cagtgtccaa agacatgatc 420 aactggtacg agctgccagt cgctatggtc ccgaccgaat ggtatgatat cgagggcgtc 480 ttatccgggt ctaccacggt ccttccaaac ggtcagatct ttgcattgta tactgggaac 540 gctaatgatt tttcccaatt acaatgcaaa gctgtacccg taaacttatc tgacccgctt 600 cttattgagt gggtcaagta tgaggataac ccaatcctgt acactccacc agggattggg 660 ttaaaggact atcgggaccc gtcaacagtc tggacaggtc ccgatggaaa gcataggatg 720 atcatgggaa ctaaacgtgg caatacaggc atggtacttg tttactatac cactgattac 780 acgaactacg agttgttgga tgagccgttg cactctgttc ccaacaccga tatgtgggaa 840 tgcgtcgact tttacccggt ttcgttaacc aatgatagtg cacttgatat ggcggcctat 900 gggtcgggta tcaaacacgt tattaaagaa agttgggagg gacatggaat ggattggtat 960 tcaatcggga catatgacgc gataaatgat aaatggactc ccgataaccc ggaactagat 1020 gtcggtatcg ggttacggtg cgattacggg aggttttttg catcaaagag tctttatgac 1080 ccattgaaga aaaggaggat cacttggggt tatgttggag aatcagatag tgctgatcag 1140 gacctctcta gaggatgggc tactgtttat aatgttggaa gaacaattgt actagataga 1200 aagaccggga cccatttact tcattggccc gttgaggaag tcgagagttt gagatacaac 1260 ggtcaggagt ttaaagagat caagctagag cccggttcaa tcattccact cgacataggc 1320 acggctacac agttggacat agttgcaaca tttgaggtgg atcaagcagc gttgaacgcg 1380 acaagtgaaa ccgatgatat ttatggttgc accactagct taggtgcagc ccaaagggga 1440 agtttgggac catttggtct tgcggttcta gccgatggaa ccctttctga gttaactccg 1500 gtttatttct atatagctaa aaaggcagat ggaggtgtgt cgacacattt ttgtaccgat 1560 aagctaaggt catcactaga ttatgatggg gagagagtgg tgtatggggg cactgttcct 1620 gtgttagatg atgaagaact cacaatgagg ctattggtgg atcattcgat agtggagggg 1680 tttgcgcaag gaggaaggac ggttataaca tcaagggcgt atccaacaaa agcgatatac 1740 gaacaagcga agctgttctt gttcaacaac gccacaggta cgagtgtgaa agcatctctc 1800 aagatttggc aaatggcttc tgcaccaatt catcaatacc ctttttaatt accggctatc 1860 gg 1862 14 6742 DNA Expression vector pJMS02 misc_feature (5830)..(5830) n = A, C, G, or T 14 ggccgcgcca tggcttcatc taccaccacc tcccctctca ttctccacga tgatcctgaa 60 aacctccagg aacccaccgg atttacgggg gttcgtcgtc catccatcgc aaaagcgctt 120 tgcgtaaccc ttgtttcggt tatggtaatc tgtggtctgg ttgctgtaat cagcaaccag 180 acacaggtac cacaagtagc caacagccat caaggtgccg ccaccacatt cacaactcag 240 ttgccaaaaa tagatatgaa acgggttccg ggagagttgg attcgggtgc tgatgtccaa 300 tggcaacgct ccgcttatca ttttcaacct gacaaaaact acattagtga tcctgatggc 360 ccaatgtatc acatgggatg gtaccatcta ttttatcagt acaacccaga atctgccata 420 tggggcaaca tcacatgggg tcactccgta tccaaagaca tgatcaactg gttccatctc 480 cctttcgcca tggttccgga ccattggtac gacatcgaag gcgtcatgac aggttccgcc 540 acagtcctcc caaacggtga gatcatcatg ctttacacgg gcaatgcgta cgatctctcc 600 caagtacaat gcttagcgta cgcagtcaac tcatcagatc cacttcttat agagtggaaa 660 aaatacgaag gcaacccggt tttattgccg ccgccagggg tgggttacaa ggattttcgg 720 gacccatcta cattgtggct gggccccgat ggtgaatata gaatggtaat ggggtccaag 780 cacaacgaga ctattggttg tgctttgatt taccatacca ctaattttac gcattttgaa 840 ttgaatgagg aggtgcttca tgcggtccca catactggta tgtgggaatg cgttgatctt 900 tatccggtat ccaccacaca cacaaacggg ttggacatgg tggataatgg gccaaatgta 960 aaatacgtgt tgaaacaaag tggggatgaa gatcgccatg attggtatgc gattggaagt 1020 tatgattggg tgaatgataa gtggtacccg gatgacccgg aaaacgatgt gggtatcggg 1080 ttaagatacg attacggaaa gttttatgcg tccaagacgt tttatgacca acataagaaa 1140 aggagggtcc tttggggcta tgttggagaa accgatcccg aaaagtatga ccttacaaag 1200 ggatgggcta acatattgaa tattccaagg accgtcgttt tggacacgaa aactaaaacc 1260 aatttgattc aatggccaat tgaggaaacc gaaaaactta ggtcgaaaaa gtatgataaa 1320 tttgtagatg tggagcttcg acccgggtca ctcattcccc tcgagatagg tacagccaca 1380 cagttggata tagttgcgac attcgaagtt gatcaaatga tgttggaatc aacgctagaa 1440 gccgatgttc tattcaactg cacgactagt gttggctcag ttggaagggg cgtgttggga 1500 ccgtttggtg tggtggttct agctgatgcc cagcgcaccg aacaacttcc tgtgtatttc 1560 tatattgcaa aagataccga cgggacgtca agaacctact tttgtgctga tgaaacaaga 1620 tcatccaagg atgtagacgt ggggaaatgg gtgtatggaa gcagtgttcc tgtcctccct 1680 aacgaaaagt acaatatgag gttactggtg gatcattcga tagtggaggg atttgcacaa 1740 aacggaagaa cggtggtgac atcgagagtg tatccaacga aggcaattta caacgctgcg 1800 aaggtgtttt tgttcaacaa cgcgaccggg attagggtga aggcgtcggt caagatttgg 1860 aagatggcgg aagcagaact caaccctttc ccagttactg ggtggacttc ttgagcggcc 1920 gcgacacaag tgtgagagta ctaaataaat gctttggttg tacgaaatca ttacactaaa 1980 taaaataatc aaagcttata tatgccttcc gctaaggccg aatgcaaaga aattggttct 2040 ttctcgttat cttttgccac ttttactagt acgtattaat tactacttaa tcatctttgt 2100 ttacggctca ttatatccgt cgacggcgcg cccgatcatc cggatatagt tcctcctttc 2160 agcaaaaaac ccctcaagac ccgtttagag gccccaaggg gttatgctag ttattgctca 2220 gcggtggcag cagccaactc agcttccttt cgggctttgt tagcagccgg atcgatccaa 2280 gctgtacctc actattcctt tgccctcgga cgagtgctgg ggcgtcggtt tccactatcg 2340 gcgagtactt ctacacagcc atcggtccag acggccgcgc ttctgcgggc gatttgtgta 2400 cgcccgacag tcccggctcc ggatcggacg attgcgtcgc atcgaccctg cgcccaagct 2460 gcatcatcga aattgccgtc aaccaagctc tgatagagtt ggtcaagacc aatgcggagc 2520 atatacgccc ggagccgcgg cgatcctgca agctccggat gcctccgctc gaagtagcgc 2580 gtctgctgct ccatacaagc caaccacggc ctccagaaga agatgttggc gacctcgtat 2640 tgggaatccc cgaacatcgc ctcgctccag tcaatgaccg ctgttatgcg gccattgtcc 2700 gtcaggacat tgttggagcc gaaatccgcg tgcacgaggt gccggacttc ggggcagtcc 2760 tcggcccaaa gcatcagctc atcgagagcc tgcgcgacgg acgcactgac ggtgtcgtcc 2820 atcacagttt gccagtgata cacatgggga tcagcaatcg cgcatatgaa atcacgccat 2880 gtagtgtatt gaccgattcc ttgcggtccg aatgggccga acccgctcgt ctggctaaga 2940 tcggccgcag cgatcgcatc catagcctcc gcgaccggct gcagaacagc gggcagttcg 3000 gtttcaggca ggtcttgcaa cgtgacaccc tgtgcacggc gggagatgca ataggtcagg 3060 ctctcgctga attccccaat gtcaagcact tccggaatcg ggagcgcggc cgatgcaaag 3120 tgccgataaa cataacgatc tttgtagaaa ccatcggcgc agctatttac ccgcaggaca 3180 tatccacgcc ctcctacatc gaagctgaaa gcacgagatt cttcgccctc cgagagctgc 3240 atcaggtcgg agacgctgtc gaacttttcg atcagaaact tctcgacaga cgtcgcggtg 3300 agttcaggct tttccatggg tatatctcct tcttaaagtt aaacaaaatt atttctagag 3360 ggaaaccgtt gtggtctccc tatagtgagt cgtattaatt tcgcgggatc gagatctgat 3420 caacctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc 3480 ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc 3540 agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa 3600 catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 3660 tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 3720 gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 3780 ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 3840 cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 3900 caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 3960 ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 4020 taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 4080 taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 4140 cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 4200 tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 4260 gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 4320 catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg 4380 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4440 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4500 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatggacat 4560 attgtcgtta gaacgcggct acaattaata cataacctta tgtatcatac acatacgatt 4620 taggtgacac tatagaacgg cgcgccaagc ttggatcctc gaagagaagg gttaataaca 4680 cattttttaa catttttaac acaaatttta gttatttaaa aatttattaa aaaatttaaa 4740 ataagaagag gaactcttta aataaatcta acttacaaaa tttatgattt ttaataagtt 4800 ttcaccaata aaaaatgtca taaaaatatg ttaaaaagta tattatcaat attctcttta 4860 tgataaataa aaagaaaaaa aaaataaaag ttaagtgaaa atgagattga agtgacttta 4920 ggtgtgtata aatatatcaa ccccgccaac aatttattta atccaaatat attgaagtat 4980 attattccat agcctttatt tatttatata tttattatat aaaagcttta tttgttctag 5040 gttgttcatg aaatattttt ttggttttat ctccgttgta agaaaatcat gtgctttgtg 5100 tcgccactca ctattgcagc tttttcatgc attggtcaga ttgacggttg attgtatttt 5160 tgttttttat ggttttgtgt tatgacttaa gtcttcatct ctttatctct tcatcaggtt 5220 tgatggttac ctaatatggt ccatgggtac atgcatggtt aaattaggtg gccaactttg 5280 ttgtgaacga tagaattttt tttatattaa gtaaactatt tttatattat gaaataataa 5340 taaaaaaaat attttatcat tattaacaaa atcatattag ttaatttgtt aactctataa 5400 taaaagaaat actgtaacat tcacattaca tggtaacatc tttccaccct ttcatttgtt 5460 ttttgtttga tgactttttt tcttgtttaa atttatttcc cttcttttaa atttggaata 5520 cattatcatc atatataaac taaaatacta aaaacaggat tacacaaatg ataaataata 5580 acacaaatat ttataaatct agctgcaata tatttaaact agctatatcg atattgtaaa 5640 ataaaactag ctgcattgat actgataaaa aaatatcatg tgctttctgg actgatgatg 5700 cagtatactt ttgacattgc ctttatttta tttttcagaa aagctttctt agttctgggt 5760 tcttcattat ttgtttccca tctccattgt gaattgaatc atttgcttcg tgtcacaaat 5820 acaatttagn taggtacatg cattggtcag attcacggtt tattatgtca tgacttaagt 5880 tcatggtagt acattacctg ccacgcatgc attatattgg ttagatttga taggcaaatt 5940 tggttgtcaa caatataaat ataaataatg tttttatatt acgaaataac agtgatcaaa 6000 acaaacagtt ttatctttat taacaagatt ttgtttttgt ttgatgacgt tttttaatgt 6060 ttacgctttc ccccttcttt tgaatttaga acactttatc atcataaaat caaatactaa 6120 aaaaattaca tatttcataa ataataacac aaatattttt aaaaaatctg aaataataat 6180 gaacaatatt acatattatc acgaaaattc attaataaaa atattatata aataaaatgt 6240 aatagtagtt atatgtagga aaaaagtact gcacgcataa tatatacaaa aagattaaaa 6300 tgaactatta taaataataa cactaaatta atggtgaatc atatcaaaat aatgaaaaag 6360 taaataaaat ttgtaattaa cttctatatg tattacacac acaaataata aataatagta 6420 aaaaaaatta tgataaatat ttaccatctc ataagatatt taaaataatg ataaaaatat 6480 agattatttt ttatgcaact agctagccaa aaagagaaca cgggtatata taaaaagagt 6540 acctttaaat tctactgtac ttcctttatt cctgacgttt ttatatcaag tggacatacg 6600 tgaagatttt aattatcagt ctaaatattt cattagcact taatactttt ctgttttatt 6660 cctatcctat aagtagtccc gattctccca acattgctta ttcacacaac taactaagaa 6720 agtcttccat agccccccaa gc 6742 15 6667 DNA Expression vector pRM03 misc_feature (3914)..(3914) n = A, C, G, or T 15 ggccgcgaca caagtgtgag agtactaaat aaatgctttg gttgtacgaa atcattacac 60 taaataaaat aatcaaagct tatatatgcc ttccgctaag gccgaatgca aagaaattgg 120 ttctttctcg ttatcttttg ccacttttac tagtacgtat taattactac ttaatcatct 180 ttgtttacgg ctcattatat ccgtcgacgg cgcgcccgat catccggata tagttcctcc 240 tttcagcaaa aaacccctca agacccgttt agaggcccca aggggttatg ctagttattg 300 ctcagcggtg gcagcagcca actcagcttc ctttcgggct ttgttagcag ccggatcgat 360 ccaagctgta cctcactatt cctttgccct cggacgagtg ctggggcgtc ggtttccact 420 atcggcgagt acttctacac agccatcggt ccagacggcc gcgcttctgc gggcgatttg 480 tgtacgcccg acagtcccgg ctccggatcg gacgattgcg tcgcatcgac cctgcgccca 540 agctgcatca tcgaaattgc cgtcaaccaa gctctgatag agttggtcaa gaccaatgcg 600 gagcatatac gcccggagcc gcggcgatcc tgcaagctcc ggatgcctcc gctcgaagta 660 gcgcgtctgc tgctccatac aagccaacca cggcctccag aagaagatgt tggcgacctc 720 gtattgggaa tccccgaaca tcgcctcgct ccagtcaatg accgctgtta tgcggccatt 780 gtccgtcagg acattgttgg agccgaaatc cgcgtgcacg aggtgccgga cttcggggca 840 gtcctcggcc caaagcatca gctcatcgag agcctgcgcg acggacgcac tgacggtgtc 900 gtccatcaca gtttgccagt gatacacatg gggatcagca atcgcgcata tgaaatcacg 960 ccatgtagtg tattgaccga ttccttgcgg tccgaatggg ccgaacccgc tcgtctggct 1020 aagatcggcc gcagcgatcg catccatagc ctccgcgacc ggctgcagaa cagcgggcag 1080 ttcggtttca ggcaggtctt gcaacgtgac accctgtgca cggcgggaga tgcaataggt 1140 caggctctcg ctgaattccc caatgtcaag cacttccgga atcgggagcg cggccgatgc 1200 aaagtgccga taaacataac gatctttgta gaaaccatcg gcgcagctat ttacccgcag 1260 gacatatcca cgccctccta catcgaagct gaaagcacga gattcttcgc cctccgagag 1320 ctgcatcagg tcggagacgc tgtcgaactt ttcgatcaga aacttctcga cagacgtcgc 1380 ggtgagttca ggcttttcca tgggtatatc tccttcttaa agttaaacaa aattatttct 1440 agagggaaac cgttgtggtc tccctatagt gagtcgtatt aatttcgcgg gatcgagatc 1500 tgatcaacct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 1560 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 1620 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 1680 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 1740 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 1800 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 1860 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 1920 gaagcgtggc gctttctcaa tgctcacgct gtaggtatct cagttcggtg taggtcgttc 1980 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 2040 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 2100 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 2160 ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 2220 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 2280 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 2340 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 2400 tggtcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt ctcgcgcgtt 2460 tcggtgatga cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc 2520 tgtaagcgga tgccgggagc agacaagccc gtcagggcgc gtcagcgggt gttggcgggt 2580 gtcggggctg gcttaactat gcggcatcag agcagattgt actgagagtg caccatatgg 2640 acatattgtc gttagaacgc ggctacaatt aatacataac cttatgtatc atacacatac 2700 gatttaggtg acactataga acggcgcgcc aagcttggat cctcgaagag aagggttaat 2760 aacacatttt ttaacatttt taacacaaat tttagttatt taaaaattta ttaaaaaatt 2820 taaaataaga agaggaactc tttaaataaa tctaacttac aaaatttatg atttttaata 2880 agttttcacc aataaaaaat gtcataaaaa tatgttaaaa agtatattat caatattctc 2940 tttatgataa ataaaaagaa aaaaaaaata aaagttaagt gaaaatgaga ttgaagtgac 3000 tttaggtgtg tataaatata tcaaccccgc caacaattta tttaatccaa atatattgaa 3060 gtatattatt ccatagcctt tatttattta tatatttatt atataaaagc tttatttgtt 3120 ctaggttgtt catgaaatat ttttttggtt ttatctccgt tgtaagaaaa tcatgtgctt 3180 tgtgtcgcca ctcactattg cagctttttc atgcattggt cagattgacg gttgattgta 3240 tttttgtttt ttatggtttt gtgttatgac ttaagtcttc atctctttat ctcttcatca 3300 ggtttgatgg ttacctaata tggtccatgg gtacatgcat ggttaaatta ggtggccaac 3360 tttgttgtga acgatagaat tttttttata ttaagtaaac tatttttata ttatgaaata 3420 ataataaaaa aaatatttta tcattattaa caaaatcata ttagttaatt tgttaactct 3480 ataataaaag aaatactgta acattcacat tacatggtaa catctttcca ccctttcatt 3540 tgttttttgt ttgatgactt tttttcttgt ttaaatttat ttcccttctt ttaaatttgg 3600 aatacattat catcatatat aaactaaaat actaaaaaca ggattacaca aatgataaat 3660 aataacacaa atatttataa atctagctgc aatatattta aactagctat atcgatattg 3720 taaaataaaa ctagctgcat tgatactgat aaaaaaatat catgtgcttt ctggactgat 3780 gatgcagtat acttttgaca ttgcctttat tttatttttc agaaaagctt tcttagttct 3840 gggttcttca ttatttgttt cccatctcca ttgtgaattg aatcatttgc ttcgtgtcac 3900 aaatacaatt tagntaggta catgcattgg tcagattcac ggtttattat gtcatgactt 3960 aagttcatgg tagtacatta cctgccacgc atgcattata ttggttagat ttgataggca 4020 aatttggttg tcaacaatat aaatataaat aatgttttta tattacgaaa taacagtgat 4080 caaaacaaac agttttatct ttattaacaa gattttgttt ttgtttgatg acgtttttta 4140 atgtttacgc tttccccctt cttttgaatt tagaacactt tatcatcata aaatcaaata 4200 ctaaaaaaat tacatatttc ataaataata acacaaatat ttttaaaaaa tctgaaataa 4260 taatgaacaa tattacatat tatcacgaaa attcattaat aaaaatatta tataaataaa 4320 atgtaatagt agttatatgt aggaaaaaag tactgcacgc ataatatata caaaaagatt 4380 aaaatgaact attataaata ataacactaa attaatggtg aatcatatca aaataatgaa 4440 aaagtaaata aaatttgtaa ttaacttcta tatgtattac acacacaaat aataaataat 4500 agtaaaaaaa attatgataa atatttacca tctcataaga tatttaaaat aatgataaaa 4560 atatagatta ttttttatgc aactagctag ccaaaaagag aacacgggta tatataaaaa 4620 gagtaccttt aaattctact gtacttcctt tattcctgac gtttttatat caagtggaca 4680 tacgtgaaga ttttaattat cagtctaaat atttcattag cacttaatac ttttctgttt 4740 tattcctatc ctataagtag tcccgattct cccaacattg cttattcaca caactaacta 4800 agaaagtctt ccatagcccc ccaagcggcc gcaccatggc aacccctgaa caacccatta 4860 cagaccttga acacgaaccc aaccacaacc gcacacccct attggaccac aacgaatcac 4920 aacccgtaaa gaaacatttg ttcttcaaag ttctgtctgg tgttaccttc atttcattgt 4980 tctttatttc tgctttttta ttcattgttt tgaaccaaca aaattctacc aatatatcgg 5040 ttaagtactc gcaatccgat cgccttacgt gggaacgaac cgcttttcat tttcaaccgg 5100 ccaagaattt tatttatgat cccaatggtc aaatgtacta catgggctgg taccatctat 5160 tctatcaata caatccatac gcaccggttt ggggtaatat gtcatggggt cactccgtat 5220 ccaaagacat gatcaactgg tacgagctac ccgtcgctat agtcccgact gaatggtatg 5280 atattgaggg cgtcttatct gggtccatca cagtgcttcc caacgggcag atctttgcat 5340 tgtacacggg gaatgctaat gacttttccc aattgcaatg caaagctgta cccgtgaact 5400 catctgaccc acttcttgtt gagtgggtca agtacgaaga taacccaatc ctgtacactc 5460 caccagggat tgggttaaaa gactataggg acccgtcaac agtctggacg ggtcctgatg 5520 gaaagcatag gatgatcatg ggaactaaac gtggcaatac aggaatgata cttgtttacc 5580 ataccactga ttacacgaac tatgagatgt tgaatgagcc tatgcactcg gttcccaata 5640 ccgatatgtg ggaatgcgtt gacttttacc cggtttcatt aaccaacgat agtgcacttg 5700 atattgcggc ctacgggtcg ggtatcaaac acgtgattaa agaaagttgg gagggatatg 5760 ggatggattt ctattcaatc gggacttatg acgcatttaa cgataaatgg actcccgata 5820 acccagagtt agatgttggt atcgggttgc ggtgtgatta cggtaggttt tttgcatcaa 5880 agagtatttt tgacccagtg aagaaaagga ggatcacttg ggcttatgtt ggagaatcag 5940 ataatgctga tgatgacctc tccagaggat gggctactat ttataatgtt ggaagaacta 6000 ttgtactaga tagaaagacc gggacccatt tacttcattg gcctgtcgag gaaatcgaga 6060 gtttgagata caatggtcag gaatttaaag agatcaaact agagcccggt tcaattgctc 6120 cactcgacat aggcaccgct acacagttgg acatagttgc aacatttaag gtggatgagg 6180 ctgcattgaa cgcgacaagt gaaaccgatg ataacttcgc ttgcaccacg agctcaggtg 6240 cagttgaaag gggaagtttg ggaccatttg gtcttgcggt tctagctgat ggaacccttt 6300 ccgagttaac tccggtttat ttctacattg ctaaaaaggc cgatggaggt gtgtcaacac 6360 atttttgtac cgataagcta aggtcatcct tggattttga taaggagaga gtggtgtacg 6420 gtagcactgt tcctgtgtta gatgatgaag aactcacaat gaggctattg gtggatcatt 6480 cggtagtcga ggcgtttgca caaggaggaa ggattgccat aacatcaagg gtgtatccga 6540 cgaaagcaat atacgaagga gcgaagttgt tcttattcaa caatgccacg gatacgagtg 6600 tgaaggcatc tctcaagatt tggcaaatgg cttctgccca aattcatcaa tacgagttta 6660 attaggc 6667 16 34 DNA Artificial linker (1)..(34) linker 16 ggcgcgccaa gcttggatcc gtcgacggcg cgcc 34 17 8890 DNA Expression vector pJMS01 17 ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag catggaatat 60 tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc tatgaataaa 120 caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct atgataaatt 180 tcctcttatt attataaatc atctgaatcg tgacggctta tggaatgctt caaatagtac 240 aaaaacaaat gtgtactata agactttcta aacaattcta accttagcat tgtgaacgag 300 acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc atttatatat 360 tatatattac ccacttatgt attatattag gatgttaagg agacataaca attataaaga 420 gagaagtttg tatccattta tatattatat actacccatt tatatattat acttatccac 480 ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag ttgatatgta 540 tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca tccttaaagt 600 gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt tggtttgata 660 aaatattgaa ggatttaaaa taataataaa taacatataa tatatgtata taaatttatt 720 ataatataac atttatctat aaaaaagtaa atattgtcat aaatctatac aatcgtttag 780 ccttgctgga cgaatctcaa ttatttaaac gagagtaaac atatttgact ttttggttat 840 ttaacaaatt attatttaac actatatgaa attttttttt ttatcagcaa agaataaaat 900 taaattaaga aggacaatgg tgtcccaatc cttatacaac caacttccac aagaaagtca 960 agtcagagac aacaaaaaaa caagcaaagg aaatttttta atttgagttg tcttgtttgc 1020 tgcataattt atgcagtaaa acactacaca taaccctttt agcagtagag caatggttga 1080 ccgtgtgctt agcttctttt attttatttt tttatcagca aagaataaat aaaataaaat 1140 gagacacttc agggatgttt caacaagctt ggatccgtcg acggcgcgcc cgatcatccg 1200 gatatagttc ctcctttcag caaaaaaccc ctcaagaccc gtttagaggc cccaaggggt 1260 tatgctagtt attgctcagc ggtggcagca gccaactcag cttcctttcg ggctttgtta 1320 gcagccggat cgatccaagc tgtacctcac tattcctttg ccctcggacg agtgctgggg 1380 cgtcggtttc cactatcggc gagtacttct acacagccat cggtccagac ggccgcgctt 1440 ctgcgggcga tttgtgtacg cccgacagtc ccggctccgg atcggacgat tgcgtcgcat 1500 cgaccctgcg cccaagctgc atcatcgaaa ttgccgtcaa ccaagctctg atagagttgg 1560 tcaagaccaa tgcggagcat atacgcccgg agccgcggcg atcctgcaag ctccggatgc 1620 ctccgctcga agtagcgcgt ctgctgctcc atacaagcca accacggcct ccagaagaag 1680 atgttggcga cctcgtattg ggaatccccg aacatcgcct cgctccagtc aatgaccgct 1740 gttatgcggc cattgtccgt caggacattg ttggagccga aatccgcgtg cacgaggtgc 1800 cggacttcgg ggcagtcctc ggcccaaagc atcagctcat cgagagcctg cgcgacggac 1860 gcactgacgg tgtcgtccat cacagtttgc cagtgataca catggggatc agcaatcgcg 1920 catatgaaat cacgccatgt agtgtattga ccgattcctt gcggtccgaa tgggccgaac 1980 ccgctcgtct ggctaagatc ggccgcagcg atcgcatcca tagcctccgc gaccggctgc 2040 agaacagcgg gcagttcggt ttcaggcagg tcttgcaacg tgacaccctg tgcacggcgg 2100 gagatgcaat aggtcaggct ctcgctgaat tccccaatgt caagcacttc cggaatcggg 2160 agcgcggccg atgcaaagtg ccgataaaca taacgatctt tgtagaaacc atcggcgcag 2220 ctatttaccc gcaggacata tccacgccct cctacatcga agctgaaagc acgagattct 2280 tcgccctccg agagctgcat caggtcggag acgctgtcga acttttcgat cagaaacttc 2340 tcgacagacg tcgcggtgag ttcaggcttt tccatgggta tatctccttc ttaaagttaa 2400 acaaaattat ttctagaggg aaaccgttgt ggtctcccta tagtgagtcg tattaatttc 2460 gcgggatcga gatcgatcca attccaatcc cacaaaaatc tgagcttaac agcacagttg 2520 ctcctctcag agcagaatcg ggtattcaac accctcatat caactactac gttgtgtata 2580 acggtccaca tgccggtata tacgatgact ggggttgtac aaaggcggca acaaacggcg 2640 ttcccggagt tgcacacaag aaatttgcca ctattacaga ggcaagagca gcagctgacg 2700 cgtacacaac aagtcagcaa acagacaggt tgaacttcat ccccaaagga gaagctcaac 2760 tcaagcccaa gagctttgct aaggccctaa caagcccacc aaagcaaaaa gcccactggc 2820 tcacgctagg aaccaaaagg cccagcagtg atccagcccc aaaagagatc tcctttgccc 2880 cggagattac aatggacgat ttcctctatc tttacgatct aggaaggaag ttcgaaggtg 2940 aaggtgacga cactatgttc accactgata atgagaaggt tagcctcttc aatttcagaa 3000 agaatgctga cccacagatg gttagagagg cctacgcagc aggtctcatc aagacgatct 3060 acccgagtaa caatctccag gagatcaaat accttcccaa gaaggttaaa gatgcagtca 3120 aaagattcag gactaattgc atcaagaaca cagagaaaga catatttctc aagatcagaa 3180 gtactattcc agtatggacg attcaaggct tgcttcataa accaaggcaa gtaatagaga 3240 ttggagtctc taaaaaggta gttcctactg aatctaaggc catgcatgga gtctaagatt 3300 caaatcgagg atctaacaga actcgccgtg aagactggcg aacagttcat acagagtctt 3360 ttacgactca atgacaagaa gaaaatcttc gtcaacatgg tggagcacga cactctggtc 3420 tactccaaaa atgtcaaaga tacagtctca gaagaccaaa gggctattga gacttttcaa 3480 caaaggataa tttcgggaaa cctcctcgga ttccattgcc cagctatctg tcacttcatc 3540 gaaaggacag tagaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 3600 gctatcattc aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg 3660 agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgac 3720 atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 3780 atataaggaa gttcatttca tttggagagg acacgctcga gctcatttct ctattacttc 3840 agccataaca aaagaactct tttctcttct tattaaacca tgaaaaagcc tgaactcacc 3900 gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 3960 ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 4020 ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 4080 gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaattcag cgagagcctg 4140 acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 4200 ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 4260 agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 4320 cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 4380 gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 4440 tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 4500 aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 4560 gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 4620 tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 4680 cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 4740 tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 4800 caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 4860 agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagtgagg tacctaaaga 4920 aggagtgcgt cgaagcagat cgttcaaaca tttggcaata aagtttctta agattgaatc 4980 ctgttgccgg tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa 5040 taattaacat gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc 5100 aattatacat ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat 5160 cgcgcgcggt gtcatctatg ttactagatc gatgtcgaat ctgatcaacc tgcattaatg 5220 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 5280 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 5340 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 5400 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 5460 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 5520 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 5580 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 5640 atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 5700 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 5760 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 5820 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 5880 tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 5940 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 6000 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 6060 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga cattaaccta 6120 taaaaatagg cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 6180 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 6240 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 6300 tgcggcatca gagcagattg tactgagagt gcaccatatg gacatattgt cgttagaacg 6360 cggctacaat taatacataa ccttatgtat catacacata cgatttaggt gacactatag 6420 aacggcgcgc caagcttttg atccatgccc ttcatttgcc gcttattaat taatttggta 6480 acagtccgta ctaatcagtt acttatcctt cccccatcat aattaatctt ggtagtctcg 6540 aatgccacaa cactgactag tctcttggat cataagaaaa agccaaggaa caaaagaaga 6600 caaaacacaa tgagagtatc ctttgcatag caatgtctaa gttcataaaa ttcaaacaaa 6660 aacgcaatca cacacagtgg acatcactta tccactagct gatcaggatc gccgcgtcaa 6720 gaaaaaaaaa ctggacccca aaagccatgc acaacaacac gtactcacaa aggtgtcaat 6780 cgagcagccc aaaacattca ccaactcaac ccatcatgag ccctcacatt tgttgtttct 6840 aacccaacct caaactcgta ttctcttccg ccacctcatt tttgtttatt tcaacacccg 6900 tcaaactgca tgccaccccg tggccaaatg tccatgcatg ttaacaagac ctatgactat 6960 aaatagctgc aatctcggcc caggttttca tcatcaagaa ccagttcaat atcctagtac 7020 accgtattaa agaatttaag atatactgcg gccgcaccat ggcaacccct gaacaaccca 7080 ttacagacct tgaacacgaa cccaaccaca accgcacacc cctattggac cacaacgaat 7140 cacaacccgt aaagaaacat ttgttcttca aagttctgtc tggtgttacc ttcatttcat 7200 tgttctttat ttctgctttt ttattcattg ttttgaacca acaaaattct accaatatat 7260 cggttaagta ctcgcaatcc gatcgcctta cgtgggaacg aaccgctttt cattttcaac 7320 cggccaagaa ttttatttat gatcccaatg gtcaaatgta ctacatgggc tggtaccatc 7380 tattctatca atacaatcca tacgcaccgg tttggggtaa tatgtcatgg ggtcactccg 7440 tatccaaaga catgatcaac tggtacgagc tacccgtcgc tatagtcccg actgaatggt 7500 atgatattga gggcgtctta tctgggtcca tcacagtgct tcccaacggg cagatctttg 7560 cattgtacac ggggaatgct aatgactttt cccaattgca atgcaaagct gtacccgtga 7620 actcatctga cccacttctt gttgagtggg tcaagtacga agataaccca atcctgtaca 7680 ctccaccagg gattgggtta aaagactata gggacccgtc aacagtctgg acgggtcctg 7740 atggaaagca taggatgatc atgggaacta aacgtggcaa tacaggaatg atacttgttt 7800 accataccac tgattacacg aactatgaga tgttgaatga gcctatgcac tcggttccca 7860 ataccgatat gtgggaatgc gttgactttt acccggtttc attaaccaac gatagtgcac 7920 ttgatattgc ggcctacggg tcgggtatca aacacgtgat taaagaaagt tgggagggat 7980 atgggatgga tttctattca atcgggactt atgacgcatt taacgataaa tggactcccg 8040 ataacccaga gttagatgtt ggtatcgggt tgcggtgtga ttacggtagg ttttttgcat 8100 caaagagtat ttttgaccca gtgaagaaaa ggaggatcac ttgggcttat gttggagaat 8160 cagataatgc tgatgatgac ctctccagag gatgggctac tatttataat gttggaagaa 8220 ctattgtact agatagaaag accgggaccc atttacttca ttggcctgtc gaggaaatcg 8280 agagtttgag atacaatggt caggaattta aagagatcaa actagagccc ggttcaattg 8340 ctccactcga cataggcacc gctacacagt tggacatagt tgcaacattt aaggtggatg 8400 aggctgcatt gaacgcgaca agtgaaaccg atgataactt cgcttgcacc acgagctcag 8460 gtgcagttga aaggggaagt ttgggaccat ttggtcttgc ggttctagct gatggaaccc 8520 tttccgagtt aactccggtt tatttctaca ttgctaaaaa ggccgatgga ggtgtgtcaa 8580 cacatttttg taccgataag ctaaggtcat ccttggattt tgataaggag agagtggtgt 8640 acggtagcac tgttcctgtg ttagatgatg aagaactcac aatgaggcta ttggtggatc 8700 attcggtagt cgaggcgttt gcacaaggag gaaggattgc cataacatca agggtgtatc 8760 cgacgaaagc aatatacgaa ggagcgaagt tgttcttatt caacaatgcc acggatacga 8820 gtgtgaaggc atctctcaag atttggcaaa tggcttctgc ccaaattcat caatacgagt 8880 ttaattaggc 8890 18 8965 DNA Expression vector pRM02 18 ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag catggaatat 60 tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc tatgaataaa 120 caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct atgataaatt 180 tcctcttatt attataaatc atctgaatcg tgacggctta tggaatgctt caaatagtac 240 aaaaacaaat gtgtactata agactttcta aacaattcta accttagcat tgtgaacgag 300 acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc atttatatat 360 tatatattac ccacttatgt attatattag gatgttaagg agacataaca attataaaga 420 gagaagtttg tatccattta tatattatat actacccatt tatatattat acttatccac 480 ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag ttgatatgta 540 tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca tccttaaagt 600 gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt tggtttgata 660 aaatattgaa ggatttaaaa taataataaa taacatataa tatatgtata taaatttatt 720 ataatataac atttatctat aaaaaagtaa atattgtcat aaatctatac aatcgtttag 780 ccttgctgga cgaatctcaa ttatttaaac gagagtaaac atatttgact ttttggttat 840 ttaacaaatt attatttaac actatatgaa attttttttt ttatcagcaa agaataaaat 900 taaattaaga aggacaatgg tgtcccaatc cttatacaac caacttccac aagaaagtca 960 agtcagagac aacaaaaaaa caagcaaagg aaatttttta atttgagttg tcttgtttgc 1020 tgcataattt atgcagtaaa acactacaca taaccctttt agcagtagag caatggttga 1080 ccgtgtgctt agcttctttt attttatttt tttatcagca aagaataaat aaaataaaat 1140 gagacacttc agggatgttt caacaagctt ggatccgtcg acggcgcgcc cgatcatccg 1200 gatatagttc ctcctttcag caaaaaaccc ctcaagaccc gtttagaggc cccaaggggt 1260 tatgctagtt attgctcagc ggtggcagca gccaactcag cttcctttcg ggctttgtta 1320 gcagccggat cgatccaagc tgtacctcac tattcctttg ccctcggacg agtgctgggg 1380 cgtcggtttc cactatcggc gagtacttct acacagccat cggtccagac ggccgcgctt 1440 ctgcgggcga tttgtgtacg cccgacagtc ccggctccgg atcggacgat tgcgtcgcat 1500 cgaccctgcg cccaagctgc atcatcgaaa ttgccgtcaa ccaagctctg atagagttgg 1560 tcaagaccaa tgcggagcat atacgcccgg agccgcggcg atcctgcaag ctccggatgc 1620 ctccgctcga agtagcgcgt ctgctgctcc atacaagcca accacggcct ccagaagaag 1680 atgttggcga cctcgtattg ggaatccccg aacatcgcct cgctccagtc aatgaccgct 1740 gttatgcggc cattgtccgt caggacattg ttggagccga aatccgcgtg cacgaggtgc 1800 cggacttcgg ggcagtcctc ggcccaaagc atcagctcat cgagagcctg cgcgacggac 1860 gcactgacgg tgtcgtccat cacagtttgc cagtgataca catggggatc agcaatcgcg 1920 catatgaaat cacgccatgt agtgtattga ccgattcctt gcggtccgaa tgggccgaac 1980 ccgctcgtct ggctaagatc ggccgcagcg atcgcatcca tagcctccgc gaccggctgc 2040 agaacagcgg gcagttcggt ttcaggcagg tcttgcaacg tgacaccctg tgcacggcgg 2100 gagatgcaat aggtcaggct ctcgctgaat tccccaatgt caagcacttc cggaatcggg 2160 agcgcggccg atgcaaagtg ccgataaaca taacgatctt tgtagaaacc atcggcgcag 2220 ctatttaccc gcaggacata tccacgccct cctacatcga agctgaaagc acgagattct 2280 tcgccctccg agagctgcat caggtcggag acgctgtcga acttttcgat cagaaacttc 2340 tcgacagacg tcgcggtgag ttcaggcttt tccatgggta tatctccttc ttaaagttaa 2400 acaaaattat ttctagaggg aaaccgttgt ggtctcccta tagtgagtcg tattaatttc 2460 gcgggatcga gatcgatcca attccaatcc cacaaaaatc tgagcttaac agcacagttg 2520 ctcctctcag agcagaatcg ggtattcaac accctcatat caactactac gttgtgtata 2580 acggtccaca tgccggtata tacgatgact ggggttgtac aaaggcggca acaaacggcg 2640 ttcccggagt tgcacacaag aaatttgcca ctattacaga ggcaagagca gcagctgacg 2700 cgtacacaac aagtcagcaa acagacaggt tgaacttcat ccccaaagga gaagctcaac 2760 tcaagcccaa gagctttgct aaggccctaa caagcccacc aaagcaaaaa gcccactggc 2820 tcacgctagg aaccaaaagg cccagcagtg atccagcccc aaaagagatc tcctttgccc 2880 cggagattac aatggacgat ttcctctatc tttacgatct aggaaggaag ttcgaaggtg 2940 aaggtgacga cactatgttc accactgata atgagaaggt tagcctcttc aatttcagaa 3000 agaatgctga cccacagatg gttagagagg cctacgcagc aggtctcatc aagacgatct 3060 acccgagtaa caatctccag gagatcaaat accttcccaa gaaggttaaa gatgcagtca 3120 aaagattcag gactaattgc atcaagaaca cagagaaaga catatttctc aagatcagaa 3180 gtactattcc agtatggacg attcaaggct tgcttcataa accaaggcaa gtaatagaga 3240 ttggagtctc taaaaaggta gttcctactg aatctaaggc catgcatgga gtctaagatt 3300 caaatcgagg atctaacaga actcgccgtg aagactggcg aacagttcat acagagtctt 3360 ttacgactca atgacaagaa gaaaatcttc gtcaacatgg tggagcacga cactctggtc 3420 tactccaaaa atgtcaaaga tacagtctca gaagaccaaa gggctattga gacttttcaa 3480 caaaggataa tttcgggaaa cctcctcgga ttccattgcc cagctatctg tcacttcatc 3540 gaaaggacag tagaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 3600 gctatcattc aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg 3660 agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgac 3720 atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 3780 atataaggaa gttcatttca tttggagagg acacgctcga gctcatttct ctattacttc 3840 agccataaca aaagaactct tttctcttct tattaaacca tgaaaaagcc tgaactcacc 3900 gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 3960 ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 4020 ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 4080 gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaattcag cgagagcctg 4140 acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 4200 ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 4260 agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 4320 cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 4380 gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 4440 tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 4500 aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 4560 gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 4620 tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 4680 cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 4740 tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 4800 caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 4860 agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagtgagg tacctaaaga 4920 aggagtgcgt cgaagcagat cgttcaaaca tttggcaata aagtttctta agattgaatc 4980 ctgttgccgg tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa 5040 taattaacat gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc 5100 aattatacat ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat 5160 cgcgcgcggt gtcatctatg ttactagatc gatgtcgaat ctgatcaacc tgcattaatg 5220 aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 5280 cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 5340 ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 5400 ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 5460 cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 5520 actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 5580 cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 5640 atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 5700 gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 5760 caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 5820 agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 5880 tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 5940 tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 6000 gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 6060 gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga cattaaccta 6120 taaaaatagg cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 6180 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 6240 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 6300 tgcggcatca gagcagattg tactgagagt gcaccatatg gacatattgt cgttagaacg 6360 cggctacaat taatacataa ccttatgtat catacacata cgatttaggt gacactatag 6420 aacggcgcgc caagcttttg atccatgccc ttcatttgcc gcttattaat taatttggta 6480 acagtccgta ctaatcagtt acttatcctt cccccatcat aattaatctt ggtagtctcg 6540 aatgccacaa cactgactag tctcttggat cataagaaaa agccaaggaa caaaagaaga 6600 caaaacacaa tgagagtatc ctttgcatag caatgtctaa gttcataaaa ttcaaacaaa 6660 aacgcaatca cacacagtgg acatcactta tccactagct gatcaggatc gccgcgtcaa 6720 gaaaaaaaaa ctggacccca aaagccatgc acaacaacac gtactcacaa aggtgtcaat 6780 cgagcagccc aaaacattca ccaactcaac ccatcatgag ccctcacatt tgttgtttct 6840 aacccaacct caaactcgta ttctcttccg ccacctcatt tttgtttatt tcaacacccg 6900 tcaaactgca tgccaccccg tggccaaatg tccatgcatg ttaacaagac ctatgactat 6960 aaatagctgc aatctcggcc caggttttca tcatcaagaa ccagttcaat atcctagtac 7020 accgtattaa agaatttaag atatactgcg gccgcgccat ggcttcatct accaccacct 7080 cccctctcat tctccacgat gatcctgaaa acctccagga acccaccgga tttacggggg 7140 ttcgtcgtcc atccatcgca aaagcgcttt gcgtaaccct tgtttcggtt atggtaatct 7200 gtggtctggt tgctgtaatc agcaaccaga cacaggtacc acaagtagcc aacagccatc 7260 aaggtgccgc caccacattc acaactcagt tgccaaaaat agatatgaaa cgggttccgg 7320 gagagttgga ttcgggtgct gatgtccaat ggcaacgctc cgcttatcat tttcaacctg 7380 acaaaaacta cattagtgat cctgatggcc caatgtatca catgggatgg taccatctat 7440 tttatcagta caacccagaa tctgccatat ggggcaacat cacatggggt cactccgtat 7500 ccaaagacat gatcaactgg ttccatctcc ctttcgccat ggttccggac cattggtacg 7560 acatcgaagg cgtcatgaca ggttccgcca cagtcctccc aaacggtgag atcatcatgc 7620 tttacacggg caatgcgtac gatctctccc aagtacaatg cttagcgtac gcagtcaact 7680 catcagatcc acttcttata gagtggaaaa aatacgaagg caacccggtt ttattgccgc 7740 cgccaggggt gggttacaag gattttcggg acccatctac attgtggctg ggccccgatg 7800 gtgaatatag aatggtaatg gggtccaagc acaacgagac tattggttgt gctttgattt 7860 accataccac taattttacg cattttgaat tgaatgagga ggtgcttcat gcggtcccac 7920 atactggtat gtgggaatgc gttgatcttt atccggtatc caccacacac acaaacgggt 7980 tggacatggt ggataatggg ccaaatgtaa aatacgtgtt gaaacaaagt ggggatgaag 8040 atcgccatga ttggtatgcg attggaagtt atgattgggt gaatgataag tggtacccgg 8100 atgacccgga aaacgatgtg ggtatcgggt taagatacga ttacggaaag ttttatgcgt 8160 ccaagacgtt ttatgaccaa cataagaaaa ggagggtcct ttggggctat gttggagaaa 8220 ccgatcccga aaagtatgac cttacaaagg gatgggctaa catattgaat attccaagga 8280 ccgtcgtttt ggacacgaaa actaaaacca atttgattca atggccaatt gaggaaaccg 8340 aaaaacttag gtcgaaaaag tatgataaat ttgtagatgt ggagcttcga cccgggtcac 8400 tcattcccct cgagataggt acagccacac agttggatat agttgcgaca ttcgaagttg 8460 atcaaatgat gttggaatca acgctagaag ccgatgttct attcaactgc acgactagtg 8520 ttggctcagt tggaaggggc gtgttgggac cgtttggtgt ggtggttcta gctgatgccc 8580 agcgcaccga acaacttcct gtgtatttct atattgcaaa agataccgac gggacgtcaa 8640 gaacctactt ttgtgctgat gaaacaagat catccaagga tgtagacgtg gggaaatggg 8700 tgtatggaag cagtgttcct gtcctcccta acgaaaagta caatatgagg ttactggtgg 8760 atcattcgat agtggaggga tttgcacaaa acggaagaac ggtggtgaca tcgagagtgt 8820 atccaacgaa ggcaatttac aacgctgcga aggtgttttt gttcaacaac gcgaccggga 8880 ttagggtgaa ggcgtcggtc aagatttgga agatggcgga agcagaactc aaccctttcc 8940 cagttactgg gtggacttct tgagc 8965 19 13092 DNA Expression vector pRM01 misc_feature (10061)..(10061) n = A, C, G, or T 19 tcgacggcgc gcccgatcat ccggatatag ttcctccttt cagcaaaaaa cccctcaaga 60 cccgtttaga ggccccaagg ggttatgcta gttattgctc agcggtggca gcagccaact 120 cagcttcctt tcgggctttg ttagcagccg gatcgatcca agctgtacct cactattcct 180 ttgccctcgg acgagtgctg gggcgtcggt ttccactatc ggcgagtact tctacacagc 240 catcggtcca gacggccgcg cttctgcggg cgatttgtgt acgcccgaca gtcccggctc 300 cggatcggac gattgcgtcg catcgaccct gcgcccaagc tgcatcatcg aaattgccgt 360 caaccaagct ctgatagagt tggtcaagac caatgcggag catatacgcc cggagccgcg 420 gcgatcctgc aagctccgga tgcctccgct cgaagtagcg cgtctgctgc tccatacaag 480 ccaaccacgg cctccagaag aagatgttgg cgacctcgta ttgggaatcc ccgaacatcg 540 cctcgctcca gtcaatgacc gctgttatgc ggccattgtc cgtcaggaca ttgttggagc 600 cgaaatccgc gtgcacgagg tgccggactt cggggcagtc ctcggcccaa agcatcagct 660 catcgagagc ctgcgcgacg gacgcactga cggtgtcgtc catcacagtt tgccagtgat 720 acacatgggg atcagcaatc gcgcatatga aatcacgcca tgtagtgtat tgaccgattc 780 cttgcggtcc gaatgggccg aacccgctcg tctggctaag atcggccgca gcgatcgcat 840 ccatagcctc cgcgaccggc tgcagaacag cgggcagttc ggtttcaggc aggtcttgca 900 acgtgacacc ctgtgcacgg cgggagatgc aataggtcag gctctcgctg aattccccaa 960 tgtcaagcac ttccggaatc gggagcgcgg ccgatgcaaa gtgccgataa acataacgat 1020 ctttgtagaa accatcggcg cagctattta cccgcaggac atatccacgc cctcctacat 1080 cgaagctgaa agcacgagat tcttcgccct ccgagagctg catcaggtcg gagacgctgt 1140 cgaacttttc gatcagaaac ttctcgacag acgtcgcggt gagttcaggc ttttccatgg 1200 gtatatctcc ttcttaaagt taaacaaaat tatttctaga gggaaaccgt tgtggtctcc 1260 ctatagtgag tcgtattaat ttcgcgggat cgagatcgat ccaattccaa tcccacaaaa 1320 atctgagctt aacagcacag ttgctcctct cagagcagaa tcgggtattc aacaccctca 1380 tatcaactac tacgttgtgt ataacggtcc acatgccggt atatacgatg actggggttg 1440 tacaaaggcg gcaacaaacg gcgttcccgg agttgcacac aagaaatttg ccactattac 1500 agaggcaaga gcagcagctg acgcgtacac aacaagtcag caaacagaca ggttgaactt 1560 catccccaaa ggagaagctc aactcaagcc caagagcttt gctaaggccc taacaagccc 1620 accaaagcaa aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc 1680 cccaaaagag atctcctttg ccccggagat tacaatggac gatttcctct atctttacga 1740 tctaggaagg aagttcgaag gtgaaggtga cgacactatg ttcaccactg ataatgagaa 1800 ggttagcctc ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc 1860 agcaggtctc atcaagacga tctacccgag taacaatctc caggagatca aataccttcc 1920 caagaaggtt aaagatgcag tcaaaagatt caggactaat tgcatcaaga acacagagaa 1980 agacatattt ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca 2040 taaaccaagg caagtaatag agattggagt ctctaaaaag gtagttccta ctgaatctaa 2100 ggccatgcat ggagtctaag attcaaatcg aggatctaac agaactcgcc gtgaagactg 2160 gcgaacagtt catacagagt cttttacgac tcaatgacaa gaagaaaatc ttcgtcaaca 2220 tggtggagca cgacactctg gtctactcca aaaatgtcaa agatacagtc tcagaagacc 2280 aaagggctat tgagactttt caacaaagga taatttcggg aaacctcctc ggattccatt 2340 gcccagctat ctgtcacttc atcgaaagga cagtagaaaa ggaaggtggc tcctacaaat 2400 gccatcattg cgataaagga aaggctatca ttcaagatgc ctctgccgac agtggtccca 2460 aagatggacc cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt 2520 caaagcaagt ggattgatgt gacatctcca ctgacgtaag ggatgacgca caatcccact 2580 atccttcgca agacccttcc tctatataag gaagttcatt tcatttggag aggacacgct 2640 cgagctcatt tctctattac ttcagccata acaaaagaac tcttttctct tcttattaaa 2700 ccatgaaaaa gcctgaactc accgcgacgt ctgtcgagaa gtttctgatc gaaaagttcg 2760 acagcgtctc cgacctgatg cagctctcgg agggcgaaga atctcgtgct ttcagcttcg 2820 atgtaggagg gcgtggatat gtcctgcggg taaatagctg cgccgatggt ttctacaaag 2880 atcgttatgt ttatcggcac tttgcatcgg ccgcgctccc gattccggaa gtgcttgaca 2940 ttggggaatt cagcgagagc ctgacctatt gcatctcccg ccgtgcacag ggtgtcacgt 3000 tgcaagacct gcctgaaacc gaactgcccg ctgttctgca gccggtcgcg gaggccatgg 3060 atgcgatcgc tgcggccgat cttagccaga cgagcgggtt cggcccattc ggaccgcaag 3120 gaatcggtca atacactaca tggcgtgatt tcatatgcgc gattgctgat ccccatgtgt 3180 atcactggca aactgtgatg gacgacaccg tcagtgcgtc cgtcgcgcag gctctcgatg 3240 agctgatgct ttgggccgag gactgccccg aagtccggca cctcgtgcac gcggatttcg 3300 gctccaacaa tgtcctgacg gacaatggcc gcataacagc ggtcattgac tggagcgagg 3360 cgatgttcgg ggattcccaa tacgaggtcg ccaacatctt cttctggagg ccgtggttgg 3420 cttgtatgga gcagcagacg cgctacttcg agcggaggca tccggagctt gcaggatcgc 3480 cgcggctccg ggcgtatatg ctccgcattg gtcttgacca actctatcag agcttggttg 3540 acggcaattt cgatgatgca gcttgggcgc agggtcgatg cgacgcaatc gtccgatccg 3600 gagccgggac tgtcgggcgt acacaaatcg cccgcagaag cgcggccgtc tggaccgatg 3660 gctgtgtaga agtactcgcc gatagtggaa accgacgccc cagcactcgt ccgagggcaa 3720 aggaatagtg aggtacctaa agaaggagtg cgtcgaagca gatcgttcaa acatttggca 3780 ataaagtttc ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct 3840 gttgaattac gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg 3900 ggtttttatg attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata 3960 gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgatgtcg 4020 aatctgatca acctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt 4080 gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga 4140 gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca 4200 ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 4260 ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 4320 cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc 4380 ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct 4440 tcgggaagcg tggcgctttc tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc 4500 gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 4560 tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 4620 gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 4680 tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag 4740 ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt 4800 agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa 4860 gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 4920 attttggtca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg 4980 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 5040 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 5100 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 5160 atggacatat tgtcgttaga acgcggctac aattaataca taaccttatg tatcatacac 5220 atacgattta ggtgacacta tagaacggcg cgccaagctt ttgatccatg cccttcattt 5280 gccgcttatt aattaatttg gtaacagtcc gtactaatca gttacttatc cttcccccat 5340 cataattaat cttggtagtc tcgaatgcca caacactgac tagtctcttg gatcataaga 5400 aaaagccaag gaacaaaaga agacaaaaca caatgagagt atcctttgca tagcaatgtc 5460 taagttcata aaattcaaac aaaaacgcaa tcacacacag tggacatcac ttatccacta 5520 gctgatcagg atcgccgcgt caagaaaaaa aaactggacc ccaaaagcca tgcacaacaa 5580 cacgtactca caaaggtgtc aatcgagcag cccaaaacat tcaccaactc aacccatcat 5640 gagccctcac atttgttgtt tctaacccaa cctcaaactc gtattctctt ccgccacctc 5700 atttttgttt atttcaacac ccgtcaaact gcatgccacc ccgtggccaa atgtccatgc 5760 atgttaacaa gacctatgac tataaatagc tgcaatctcg gcccaggttt tcatcatcaa 5820 gaaccagttc aatatcctag tacaccgtat taaagaattt aagatatact gcggccgcac 5880 catggcaacc cctgaacaac ccattacaga ccttgaacac gaacccaacc acaaccgcac 5940 acccctattg gaccacaacg aatcacaacc cgtaaagaaa catttgttct tcaaagttct 6000 gtctggtgtt accttcattt cattgttctt tatttctgct tttttattca ttgttttgaa 6060 ccaacaaaat tctaccaata tatcggttaa gtactcgcaa tccgatcgcc ttacgtggga 6120 acgaaccgct tttcattttc aaccggccaa gaattttatt tatgatccca atggtcaaat 6180 gtactacatg ggctggtacc atctattcta tcaatacaat ccatacgcac cggtttgggg 6240 taatatgtca tggggtcact ccgtatccaa agacatgatc aactggtacg agctacccgt 6300 cgctatagtc ccgactgaat ggtatgatat tgagggcgtc ttatctgggt ccatcacagt 6360 gcttcccaac gggcagatct ttgcattgta cacggggaat gctaatgact tttcccaatt 6420 gcaatgcaaa gctgtacccg tgaactcatc tgacccactt cttgttgagt gggtcaagta 6480 cgaagataac ccaatcctgt acactccacc agggattggg ttaaaagact atagggaccc 6540 gtcaacagtc tggacgggtc ctgatggaaa gcataggatg atcatgggaa ctaaacgtgg 6600 caatacagga atgatacttg tttaccatac cactgattac acgaactatg agatgttgaa 6660 tgagcctatg cactcggttc ccaataccga tatgtgggaa tgcgttgact tttacccggt 6720 ttcattaacc aacgatagtg cacttgatat tgcggcctac gggtcgggta tcaaacacgt 6780 gattaaagaa agttgggagg gatatgggat ggatttctat tcaatcggga cttatgacgc 6840 atttaacgat aaatggactc ccgataaccc agagttagat gttggtatcg ggttgcggtg 6900 tgattacggt aggttttttg catcaaagag tatttttgac ccagtgaaga aaaggaggat 6960 cacttgggct tatgttggag aatcagataa tgctgatgat gacctctcca gaggatgggc 7020 tactatttat aatgttggaa gaactattgt actagataga aagaccggga cccatttact 7080 tcattggcct gtcgaggaaa tcgagagttt gagatacaat ggtcaggaat ttaaagagat 7140 caaactagag cccggttcaa ttgctccact cgacataggc accgctacac agttggacat 7200 agttgcaaca tttaaggtgg atgaggctgc attgaacgcg acaagtgaaa ccgatgataa 7260 cttcgcttgc accacgagct caggtgcagt tgaaagggga agtttgggac catttggtct 7320 tgcggttcta gctgatggaa ccctttccga gttaactccg gtttatttct acattgctaa 7380 aaaggccgat ggaggtgtgt caacacattt ttgtaccgat aagctaaggt catccttgga 7440 ttttgataag gagagagtgg tgtacggtag cactgttcct gtgttagatg atgaagaact 7500 cacaatgagg ctattggtgg atcattcggt agtcgaggcg tttgcacaag gaggaaggat 7560 tgccataaca tcaagggtgt atccgacgaa agcaatatac gaaggagcga agttgttctt 7620 attcaacaat gccacggata cgagtgtgaa ggcatctctc aagatttggc aaatggcttc 7680 tgcccaaatt catcaatacg agtttaatta ggcggccgca agtatgaact aaaatgcatg 7740 taggtgtaag agctcatgga gagcatggaa tattgtatcc gaccatgtaa cagtataata 7800 actgagctcc atctcacttc ttctatgaat aaacaaagga tgttatgata tattaacact 7860 ctatctatgc accttattgt tctatgataa atttcctctt attattataa atcatctgaa 7920 tcgtgacggc ttatggaatg cttcaaatag tacaaaaaca aatgtgtact ataagacttt 7980 ctaaacaatt ctaaccttag cattgtgaac gagacataag tgttaagaag acataacaat 8040 tataatggaa gaagtttgtc tccatttata tattatatat tacccactta tgtattatat 8100 taggatgtta aggagacata acaattataa agagagaagt ttgtatccat ttatatatta 8160 tatactaccc atttatatat tatacttatc cacttattta atgtctttat aaggtttgat 8220 ccatgatatt tctaatattt tagttgatat gtatatgaaa gggtactatt tgaactctct 8280 tactctgtat aaaggttgga tcatccttaa agtgggtcta tttaatttta ttgcttctta 8340 cagataaaaa aaaaattatg agttggtttg ataaaatatt gaaggattta aaataataat 8400 aaataacata taatatatgt atataaattt attataatat aacatttatc tataaaaaag 8460 taaatattgt cataaatcta tacaatcgtt tagccttgct ggacgaatct caattattta 8520 aacgagagta aacatatttg actttttggt tatttaacaa attattattt aacactatat 8580 gaaatttttt tttttatcag caaagaataa aattaaatta agaaggacaa tggtgtccca 8640 atccttatac aaccaacttc cacaagaaag tcaagtcaga gacaacaaaa aaacaagcaa 8700 aggaaatttt ttaatttgag ttgtcttgtt tgctgcataa tttatgcagt aaaacactac 8760 acataaccct tttagcagta gagcaatggt tgaccgtgtg cttagcttct tttattttat 8820 ttttttatca gcaaagaata aataaaataa aatgagacac ttcagggatg tttcaacaag 8880 cttggatcct cgaagagaag ggttaataac acatttttta acatttttaa cacaaatttt 8940 agttatttaa aaatttatta aaaaatttaa aataagaaga ggaactcttt aaataaatct 9000 aacttacaaa atttatgatt tttaataagt tttcaccaat aaaaaatgtc ataaaaatat 9060 gttaaaaagt atattatcaa tattctcttt atgataaata aaaagaaaaa aaaaataaaa 9120 gttaagtgaa aatgagattg aagtgacttt aggtgtgtat aaatatatca accccgccaa 9180 caatttattt aatccaaata tattgaagta tattattcca tagcctttat ttatttatat 9240 atttattata taaaagcttt atttgttcta ggttgttcat gaaatatttt tttggtttta 9300 tctccgttgt aagaaaatca tgtgctttgt gtcgccactc actattgcag ctttttcatg 9360 cattggtcag attgacggtt gattgtattt ttgtttttta tggttttgtg ttatgactta 9420 agtcttcatc tctttatctc ttcatcaggt ttgatggtta cctaatatgg tccatgggta 9480 catgcatggt taaattaggt ggccaacttt gttgtgaacg atagaatttt ttttatatta 9540 agtaaactat ttttatatta tgaaataata ataaaaaaaa tattttatca ttattaacaa 9600 aatcatatta gttaatttgt taactctata ataaaagaaa tactgtaaca ttcacattac 9660 atggtaacat ctttccaccc tttcatttgt tttttgtttg atgacttttt ttcttgttta 9720 aatttatttc ccttctttta aatttggaat acattatcat catatataaa ctaaaatact 9780 aaaaacagga ttacacaaat gataaataat aacacaaata tttataaatc tagctgcaat 9840 atatttaaac tagctatatc gatattgtaa aataaaacta gctgcattga tactgataaa 9900 aaaatatcat gtgctttctg gactgatgat gcagtatact tttgacattg cctttatttt 9960 atttttcaga aaagctttct tagttctggg ttcttcatta tttgtttccc atctccattg 10020 tgaattgaat catttgcttc gtgtcacaaa tacaatttag ntaggtacat gcattggtca 10080 gattcacggt ttattatgtc atgacttaag ttcatggtag tacattacct gccacgcatg 10140 cattatattg gttagatttg ataggcaaat ttggttgtca acaatataaa tataaataat 10200 gtttttatat tacgaaataa cagtgatcaa aacaaacagt tttatcttta ttaacaagat 10260 tttgtttttg tttgatgacg ttttttaatg tttacgcttt cccccttctt ttgaatttag 10320 aacactttat catcataaaa tcaaatacta aaaaaattac atatttcata aataataaca 10380 caaatatttt taaaaaatct gaaataataa tgaacaatat tacatattat cacgaaaatt 10440 cattaataaa aatattatat aaataaaatg taatagtagt tatatgtagg aaaaaagtac 10500 tgcacgcata atatatacaa aaagattaaa atgaactatt ataaataata acactaaatt 10560 aatggtgaat catatcaaaa taatgaaaaa gtaaataaaa tttgtaatta acttctatat 10620 gtattacaca cacaaataat aaataatagt aaaaaaaatt atgataaata tttaccatct 10680 cataagatat ttaaaataat gataaaaata tagattattt tttatgcaac tagctagcca 10740 aaaagagaac acgggtatat ataaaaagag tacctttaaa ttctactgta cttcctttat 10800 tcctgacgtt tttatatcaa gtggacatac gtgaagattt taattatcag tctaaatatt 10860 tcattagcac ttaatacttt tctgttttat tcctatccta taagtagtcc cgattctccc 10920 aacattgctt attcacacaa ctaactaaga aagtcttcca tagcccccca agcggccgcg 10980 ccatggcttc atctaccacc acctcccctc tcattctcca cgatgatcct gaaaacctcc 11040 aggaacccac cggatttacg ggggttcgtc gtccatccat cgcaaaagcg ctttgcgtaa 11100 cccttgtttc ggttatggta atctgtggtc tggttgctgt aatcagcaac cagacacagg 11160 taccacaagt agccaacagc catcaaggtg ccgccaccac attcacaact cagttgccaa 11220 aaatagatat gaaacgggtt ccgggagagt tggattcggg tgctgatgtc caatggcaac 11280 gctccgctta tcattttcaa cctgacaaaa actacattag tgatcctgat ggcccaatgt 11340 atcacatggg atggtaccat ctattttatc agtacaaccc agaatctgcc atatggggca 11400 acatcacatg gggtcactcc gtatccaaag acatgatcaa ctggttccat ctccctttcg 11460 ccatggttcc ggaccattgg tacgacatcg aaggcgtcat gacaggttcc gccacagtcc 11520 tcccaaacgg tgagatcatc atgctttaca cgggcaatgc gtacgatctc tcccaagtac 11580 aatgcttagc gtacgcagtc aactcatcag atccacttct tatagagtgg aaaaaatacg 11640 aaggcaaccc ggttttattg ccgccgccag gggtgggtta caaggatttt cgggacccat 11700 ctacattgtg gctgggcccc gatggtgaat atagaatggt aatggggtcc aagcacaacg 11760 agactattgg ttgtgctttg atttaccata ccactaattt tacgcatttt gaattgaatg 11820 aggaggtgct tcatgcggtc ccacatactg gtatgtggga atgcgttgat ctttatccgg 11880 tatccaccac acacacaaac gggttggaca tggtggataa tgggccaaat gtaaaatacg 11940 tgttgaaaca aagtggggat gaagatcgcc atgattggta tgcgattgga agttatgatt 12000 gggtgaatga taagtggtac ccggatgacc cggaaaacga tgtgggtatc gggttaagat 12060 acgattacgg aaagttttat gcgtccaaga cgttttatga ccaacataag aaaaggaggg 12120 tcctttgggg ctatgttgga gaaaccgatc ccgaaaagta tgaccttaca aagggatggg 12180 ctaacatatt gaatattcca aggaccgtcg ttttggacac gaaaactaaa accaatttga 12240 ttcaatggcc aattgaggaa accgaaaaac ttaggtcgaa aaagtatgat aaatttgtag 12300 atgtggagct tcgacccggg tcactcattc ccctcgagat aggtacagcc acacagttgg 12360 atatagttgc gacattcgaa gttgatcaaa tgatgttgga atcaacgcta gaagccgatg 12420 ttctattcaa ctgcacgact agtgttggct cagttggaag gggcgtgttg ggaccgtttg 12480 gtgtggtggt tctagctgat gcccagcgca ccgaacaact tcctgtgtat ttctatattg 12540 caaaagatac cgacgggacg tcaagaacct acttttgtgc tgatgaaaca agatcatcca 12600 aggatgtaga cgtggggaaa tgggtgtatg gaagcagtgt tcctgtcctc cctaacgaaa 12660 agtacaatat gaggttactg gtggatcatt cgatagtgga gggatttgca caaaacggaa 12720 gaacggtggt gacatcgaga gtgtatccaa cgaaggcaat ttacaacgct gcgaaggtgt 12780 ttttgttcaa caacgcgacc gggattaggg tgaaggcgtc ggtcaagatt tggaagatgg 12840 cggaagcaga actcaaccct ttcccagtta ctgggtggac ttcttgagcg gccgcgacac 12900 aagtgtgaga gtactaaata aatgctttgg ttgtacgaaa tcattacact aaataaaata 12960 atcaaagctt atatatgcct tccgctaagg ccgaatgcaa agaaattggt tctttctcgt 13020 tatcttttgc cacttttact agtacgtatt aattactact taatcatctt tgtttacggc 13080 tcattatatc cg 13092 20 13092 DNA Expression vector pRM04 misc_feature (10136)..(10136) n = A, C, G, or T 20 tcgacggcgc gcccgatcat ccggatatag ttcctccttt cagcaaaaaa cccctcaaga 60 cccgtttaga ggccccaagg ggttatgcta gttattgctc agcggtggca gcagccaact 120 cagcttcctt tcgggctttg ttagcagccg gatcgatcca agctgtacct cactattcct 180 ttgccctcgg acgagtgctg gggcgtcggt ttccactatc ggcgagtact tctacacagc 240 catcggtcca gacggccgcg cttctgcggg cgatttgtgt acgcccgaca gtcccggctc 300 cggatcggac gattgcgtcg catcgaccct gcgcccaagc tgcatcatcg aaattgccgt 360 caaccaagct ctgatagagt tggtcaagac caatgcggag catatacgcc cggagccgcg 420 gcgatcctgc aagctccgga tgcctccgct cgaagtagcg cgtctgctgc tccatacaag 480 ccaaccacgg cctccagaag aagatgttgg cgacctcgta ttgggaatcc ccgaacatcg 540 cctcgctcca gtcaatgacc gctgttatgc ggccattgtc cgtcaggaca ttgttggagc 600 cgaaatccgc gtgcacgagg tgccggactt cggggcagtc ctcggcccaa agcatcagct 660 catcgagagc ctgcgcgacg gacgcactga cggtgtcgtc catcacagtt tgccagtgat 720 acacatgggg atcagcaatc gcgcatatga aatcacgcca tgtagtgtat tgaccgattc 780 cttgcggtcc gaatgggccg aacccgctcg tctggctaag atcggccgca gcgatcgcat 840 ccatagcctc cgcgaccggc tgcagaacag cgggcagttc ggtttcaggc aggtcttgca 900 acgtgacacc ctgtgcacgg cgggagatgc aataggtcag gctctcgctg aattccccaa 960 tgtcaagcac ttccggaatc gggagcgcgg ccgatgcaaa gtgccgataa acataacgat 1020 ctttgtagaa accatcggcg cagctattta cccgcaggac atatccacgc cctcctacat 1080 cgaagctgaa agcacgagat tcttcgccct ccgagagctg catcaggtcg gagacgctgt 1140 cgaacttttc gatcagaaac ttctcgacag acgtcgcggt gagttcaggc ttttccatgg 1200 gtatatctcc ttcttaaagt taaacaaaat tatttctaga gggaaaccgt tgtggtctcc 1260 ctatagtgag tcgtattaat ttcgcgggat cgagatcgat ccaattccaa tcccacaaaa 1320 atctgagctt aacagcacag ttgctcctct cagagcagaa tcgggtattc aacaccctca 1380 tatcaactac tacgttgtgt ataacggtcc acatgccggt atatacgatg actggggttg 1440 tacaaaggcg gcaacaaacg gcgttcccgg agttgcacac aagaaatttg ccactattac 1500 agaggcaaga gcagcagctg acgcgtacac aacaagtcag caaacagaca ggttgaactt 1560 catccccaaa ggagaagctc aactcaagcc caagagcttt gctaaggccc taacaagccc 1620 accaaagcaa aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc 1680 cccaaaagag atctcctttg ccccggagat tacaatggac gatttcctct atctttacga 1740 tctaggaagg aagttcgaag gtgaaggtga cgacactatg ttcaccactg ataatgagaa 1800 ggttagcctc ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc 1860 agcaggtctc atcaagacga tctacccgag taacaatctc caggagatca aataccttcc 1920 caagaaggtt aaagatgcag tcaaaagatt caggactaat tgcatcaaga acacagagaa 1980 agacatattt ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca 2040 taaaccaagg caagtaatag agattggagt ctctaaaaag gtagttccta ctgaatctaa 2100 ggccatgcat ggagtctaag attcaaatcg aggatctaac agaactcgcc gtgaagactg 2160 gcgaacagtt catacagagt cttttacgac tcaatgacaa gaagaaaatc ttcgtcaaca 2220 tggtggagca cgacactctg gtctactcca aaaatgtcaa agatacagtc tcagaagacc 2280 aaagggctat tgagactttt caacaaagga taatttcggg aaacctcctc ggattccatt 2340 gcccagctat ctgtcacttc atcgaaagga cagtagaaaa ggaaggtggc tcctacaaat 2400 gccatcattg cgataaagga aaggctatca ttcaagatgc ctctgccgac agtggtccca 2460 aagatggacc cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt 2520 caaagcaagt ggattgatgt gacatctcca ctgacgtaag ggatgacgca caatcccact 2580 atccttcgca agacccttcc tctatataag gaagttcatt tcatttggag aggacacgct 2640 cgagctcatt tctctattac ttcagccata acaaaagaac tcttttctct tcttattaaa 2700 ccatgaaaaa gcctgaactc accgcgacgt ctgtcgagaa gtttctgatc gaaaagttcg 2760 acagcgtctc cgacctgatg cagctctcgg agggcgaaga atctcgtgct ttcagcttcg 2820 atgtaggagg gcgtggatat gtcctgcggg taaatagctg cgccgatggt ttctacaaag 2880 atcgttatgt ttatcggcac tttgcatcgg ccgcgctccc gattccggaa gtgcttgaca 2940 ttggggaatt cagcgagagc ctgacctatt gcatctcccg ccgtgcacag ggtgtcacgt 3000 tgcaagacct gcctgaaacc gaactgcccg ctgttctgca gccggtcgcg gaggccatgg 3060 atgcgatcgc tgcggccgat cttagccaga cgagcgggtt cggcccattc ggaccgcaag 3120 gaatcggtca atacactaca tggcgtgatt tcatatgcgc gattgctgat ccccatgtgt 3180 atcactggca aactgtgatg gacgacaccg tcagtgcgtc cgtcgcgcag gctctcgatg 3240 agctgatgct ttgggccgag gactgccccg aagtccggca cctcgtgcac gcggatttcg 3300 gctccaacaa tgtcctgacg gacaatggcc gcataacagc ggtcattgac tggagcgagg 3360 cgatgttcgg ggattcccaa tacgaggtcg ccaacatctt cttctggagg ccgtggttgg 3420 cttgtatgga gcagcagacg cgctacttcg agcggaggca tccggagctt gcaggatcgc 3480 cgcggctccg ggcgtatatg ctccgcattg gtcttgacca actctatcag agcttggttg 3540 acggcaattt cgatgatgca gcttgggcgc agggtcgatg cgacgcaatc gtccgatccg 3600 gagccgggac tgtcgggcgt acacaaatcg cccgcagaag cgcggccgtc tggaccgatg 3660 gctgtgtaga agtactcgcc gatagtggaa accgacgccc cagcactcgt ccgagggcaa 3720 aggaatagtg aggtacctaa agaaggagtg cgtcgaagca gatcgttcaa acatttggca 3780 ataaagtttc ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct 3840 gttgaattac gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg 3900 ggtttttatg attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata 3960 gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgatgtcg 4020 aatctgatca acctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt 4080 gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga 4140 gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca 4200 ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 4260 ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 4320 cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc 4380 ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct 4440 tcgggaagcg tggcgctttc tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc 4500 gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 4560 tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 4620 gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 4680 tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag 4740 ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt 4800 agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa 4860 gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 4920 attttggtca tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg 4980 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 5040 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 5100 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 5160 atggacatat tgtcgttaga acgcggctac aattaataca taaccttatg tatcatacac 5220 atacgattta ggtgacacta tagaacggcg cgccaagctt ttgatccatg cccttcattt 5280 gccgcttatt aattaatttg gtaacagtcc gtactaatca gttacttatc cttcccccat 5340 cataattaat cttggtagtc tcgaatgcca caacactgac tagtctcttg gatcataaga 5400 aaaagccaag gaacaaaaga agacaaaaca caatgagagt atcctttgca tagcaatgtc 5460 taagttcata aaattcaaac aaaaacgcaa tcacacacag tggacatcac ttatccacta 5520 gctgatcagg atcgccgcgt caagaaaaaa aaactggacc ccaaaagcca tgcacaacaa 5580 cacgtactca caaaggtgtc aatcgagcag cccaaaacat tcaccaactc aacccatcat 5640 gagccctcac atttgttgtt tctaacccaa cctcaaactc gtattctctt ccgccacctc 5700 atttttgttt atttcaacac ccgtcaaact gcatgccacc ccgtggccaa atgtccatgc 5760 atgttaacaa gacctatgac tataaatagc tgcaatctcg gcccaggttt tcatcatcaa 5820 gaaccagttc aatatcctag tacaccgtat taaagaattt aagatatact gcggccgcgc 5880 catggcttca tctaccacca cctcccctct cattctccac gatgatcctg aaaacctcca 5940 ggaacccacc ggatttacgg gggttcgtcg tccatccatc gcaaaagcgc tttgcgtaac 6000 ccttgtttcg gttatggtaa tctgtggtct ggttgctgta atcagcaacc agacacaggt 6060 accacaagta gccaacagcc atcaaggtgc cgccaccaca ttcacaactc agttgccaaa 6120 aatagatatg aaacgggttc cgggagagtt ggattcgggt gctgatgtcc aatggcaacg 6180 ctccgcttat cattttcaac ctgacaaaaa ctacattagt gatcctgatg gcccaatgta 6240 tcacatggga tggtaccatc tattttatca gtacaaccca gaatctgcca tatggggcaa 6300 catcacatgg ggtcactccg tatccaaaga catgatcaac tggttccatc tccctttcgc 6360 catggttccg gaccattggt acgacatcga aggcgtcatg acaggttccg ccacagtcct 6420 cccaaacggt gagatcatca tgctttacac gggcaatgcg tacgatctct cccaagtaca 6480 atgcttagcg tacgcagtca actcatcaga tccacttctt atagagtgga aaaaatacga 6540 aggcaacccg gttttattgc cgccgccagg ggtgggttac aaggattttc gggacccatc 6600 tacattgtgg ctgggccccg atggtgaata tagaatggta atggggtcca agcacaacga 6660 gactattggt tgtgctttga tttaccatac cactaatttt acgcattttg aattgaatga 6720 ggaggtgctt catgcggtcc cacatactgg tatgtgggaa tgcgttgatc tttatccggt 6780 atccaccaca cacacaaacg ggttggacat ggtggataat gggccaaatg taaaatacgt 6840 gttgaaacaa agtggggatg aagatcgcca tgattggtat gcgattggaa gttatgattg 6900 ggtgaatgat aagtggtacc cggatgaccc ggaaaacgat gtgggtatcg ggttaagata 6960 cgattacgga aagttttatg cgtccaagac gttttatgac caacataaga aaaggagggt 7020 cctttggggc tatgttggag aaaccgatcc cgaaaagtat gaccttacaa agggatgggc 7080 taacatattg aatattccaa ggaccgtcgt tttggacacg aaaactaaaa ccaatttgat 7140 tcaatggcca attgaggaaa ccgaaaaact taggtcgaaa aagtatgata aatttgtaga 7200 tgtggagctt cgacccgggt cactcattcc cctcgagata ggtacagcca cacagttgga 7260 tatagttgcg acattcgaag ttgatcaaat gatgttggaa tcaacgctag aagccgatgt 7320 tctattcaac tgcacgacta gtgttggctc agttggaagg ggcgtgttgg gaccgtttgg 7380 tgtggtggtt ctagctgatg cccagcgcac cgaacaactt cctgtgtatt tctatattgc 7440 aaaagatacc gacgggacgt caagaaccta cttttgtgct gatgaaacaa gatcatccaa 7500 ggatgtagac gtggggaaat gggtgtatgg aagcagtgtt cctgtcctcc ctaacgaaaa 7560 gtacaatatg aggttactgg tggatcattc gatagtggag ggatttgcac aaaacggaag 7620 aacggtggtg acatcgagag tgtatccaac gaaggcaatt tacaacgctg cgaaggtgtt 7680 tttgttcaac aacgcgaccg ggattagggt gaaggcgtcg gtcaagattt ggaagatggc 7740 ggaagcagaa ctcaaccctt tcccagttac tgggtggact tcttgagcgg ccgcaagtat 7800 gaactaaaat gcatgtaggt gtaagagctc atggagagca tggaatattg tatccgacca 7860 tgtaacagta taataactga gctccatctc acttcttcta tgaataaaca aaggatgtta 7920 tgatatatta acactctatc tatgcacctt attgttctat gataaatttc ctcttattat 7980 tataaatcat ctgaatcgtg acggcttatg gaatgcttca aatagtacaa aaacaaatgt 8040 gtactataag actttctaaa caattctaac cttagcattg tgaacgagac ataagtgtta 8100 agaagacata acaattataa tggaagaagt ttgtctccat ttatatatta tatattaccc 8160 acttatgtat tatattagga tgttaaggag acataacaat tataaagaga gaagtttgta 8220 tccatttata tattatatac tacccattta tatattatac ttatccactt atttaatgtc 8280 tttataaggt ttgatccatg atatttctaa tattttagtt gatatgtata tgaaagggta 8340 ctatttgaac tctcttactc tgtataaagg ttggatcatc cttaaagtgg gtctatttaa 8400 ttttattgct tcttacagat aaaaaaaaaa ttatgagttg gtttgataaa atattgaagg 8460 atttaaaata ataataaata acatataata tatgtatata aatttattat aatataacat 8520 ttatctataa aaaagtaaat attgtcataa atctatacaa tcgtttagcc ttgctggacg 8580 aatctcaatt atttaaacga gagtaaacat atttgacttt ttggttattt aacaaattat 8640 tatttaacac tatatgaaat tttttttttt atcagcaaag aataaaatta aattaagaag 8700 gacaatggtg tcccaatcct tatacaacca acttccacaa gaaagtcaag tcagagacaa 8760 caaaaaaaca agcaaaggaa attttttaat ttgagttgtc ttgtttgctg cataatttat 8820 gcagtaaaac actacacata acccttttag cagtagagca atggttgacc gtgtgcttag 8880 cttcttttat tttatttttt tatcagcaaa gaataaataa aataaaatga gacacttcag 8940 ggatgtttca acaagcttgg atcctcgaag agaagggtta ataacacatt ttttaacatt 9000 tttaacacaa attttagtta tttaaaaatt tattaaaaaa tttaaaataa gaagaggaac 9060 tctttaaata aatctaactt acaaaattta tgatttttaa taagttttca ccaataaaaa 9120 atgtcataaa aatatgttaa aaagtatatt atcaatattc tctttatgat aaataaaaag 9180 aaaaaaaaaa taaaagttaa gtgaaaatga gattgaagtg actttaggtg tgtataaata 9240 tatcaacccc gccaacaatt tatttaatcc aaatatattg aagtatatta ttccatagcc 9300 tttatttatt tatatattta ttatataaaa gctttatttg ttctaggttg ttcatgaaat 9360 atttttttgg ttttatctcc gttgtaagaa aatcatgtgc tttgtgtcgc cactcactat 9420 tgcagctttt tcatgcattg gtcagattga cggttgattg tatttttgtt ttttatggtt 9480 ttgtgttatg acttaagtct tcatctcttt atctcttcat caggtttgat ggttacctaa 9540 tatggtccat gggtacatgc atggttaaat taggtggcca actttgttgt gaacgataga 9600 atttttttta tattaagtaa actattttta tattatgaaa taataataaa aaaaatattt 9660 tatcattatt aacaaaatca tattagttaa tttgttaact ctataataaa agaaatactg 9720 taacattcac attacatggt aacatctttc caccctttca tttgtttttt gtttgatgac 9780 tttttttctt gtttaaattt atttcccttc ttttaaattt ggaatacatt atcatcatat 9840 ataaactaaa atactaaaaa caggattaca caaatgataa ataataacac aaatatttat 9900 aaatctagct gcaatatatt taaactagct atatcgatat tgtaaaataa aactagctgc 9960 attgatactg ataaaaaaat atcatgtgct ttctggactg atgatgcagt atacttttga 10020 cattgccttt attttatttt tcagaaaagc tttcttagtt ctgggttctt cattatttgt 10080 ttcccatctc cattgtgaat tgaatcattt gcttcgtgtc acaaatacaa tttagntagg 10140 tacatgcatt ggtcagattc acggtttatt atgtcatgac ttaagttcat ggtagtacat 10200 tacctgccac gcatgcatta tattggttag atttgatagg caaatttggt tgtcaacaat 10260 ataaatataa ataatgtttt tatattacga aataacagtg atcaaaacaa acagttttat 10320 ctttattaac aagattttgt ttttgtttga tgacgttttt taatgtttac gctttccccc 10380 ttcttttgaa tttagaacac tttatcatca taaaatcaaa tactaaaaaa attacatatt 10440 tcataaataa taacacaaat atttttaaaa aatctgaaat aataatgaac aatattacat 10500 attatcacga aaattcatta ataaaaatat tatataaata aaatgtaata gtagttatat 10560 gtaggaaaaa agtactgcac gcataatata tacaaaaaga ttaaaatgaa ctattataaa 10620 taataacact aaattaatgg tgaatcatat caaaataatg aaaaagtaaa taaaatttgt 10680 aattaacttc tatatgtatt acacacacaa ataataaata atagtaaaaa aaattatgat 10740 aaatatttac catctcataa gatatttaaa ataatgataa aaatatagat tattttttat 10800 gcaactagct agccaaaaag agaacacggg tatatataaa aagagtacct ttaaattcta 10860 ctgtacttcc tttattcctg acgtttttat atcaagtgga catacgtgaa gattttaatt 10920 atcagtctaa atatttcatt agcacttaat acttttctgt tttattccta tcctataagt 10980 agtcccgatt ctcccaacat tgcttattca cacaactaac taagaaagtc ttccatagcc 11040 ccccaagcgg ccgcaccatg gcaacccctg aacaacccat tacagacctt gaacacgaac 11100 ccaaccacaa ccgcacaccc ctattggacc acaacgaatc acaacccgta aagaaacatt 11160 tgttcttcaa agttctgtct ggtgttacct tcatttcatt gttctttatt tctgcttttt 11220 tattcattgt tttgaaccaa caaaattcta ccaatatatc ggttaagtac tcgcaatccg 11280 atcgccttac gtgggaacga accgcttttc attttcaacc ggccaagaat tttatttatg 11340 atcccaatgg tcaaatgtac tacatgggct ggtaccatct attctatcaa tacaatccat 11400 acgcaccggt ttggggtaat atgtcatggg gtcactccgt atccaaagac atgatcaact 11460 ggtacgagct acccgtcgct atagtcccga ctgaatggta tgatattgag ggcgtcttat 11520 ctgggtccat cacagtgctt cccaacgggc agatctttgc attgtacacg gggaatgcta 11580 atgacttttc ccaattgcaa tgcaaagctg tacccgtgaa ctcatctgac ccacttcttg 11640 ttgagtgggt caagtacgaa gataacccaa tcctgtacac tccaccaggg attgggttaa 11700 aagactatag ggacccgtca acagtctgga cgggtcctga tggaaagcat aggatgatca 11760 tgggaactaa acgtggcaat acaggaatga tacttgttta ccataccact gattacacga 11820 actatgagat gttgaatgag cctatgcact cggttcccaa taccgatatg tgggaatgcg 11880 ttgactttta cccggtttca ttaaccaacg atagtgcact tgatattgcg gcctacgggt 11940 cgggtatcaa acacgtgatt aaagaaagtt gggagggata tgggatggat ttctattcaa 12000 tcgggactta tgacgcattt aacgataaat ggactcccga taacccagag ttagatgttg 12060 gtatcgggtt gcggtgtgat tacggtaggt tttttgcatc aaagagtatt tttgacccag 12120 tgaagaaaag gaggatcact tgggcttatg ttggagaatc agataatgct gatgatgacc 12180 tctccagagg atgggctact atttataatg ttggaagaac tattgtacta gatagaaaga 12240 ccgggaccca tttacttcat tggcctgtcg aggaaatcga gagtttgaga tacaatggtc 12300 aggaatttaa agagatcaaa ctagagcccg gttcaattgc tccactcgac ataggcaccg 12360 ctacacagtt ggacatagtt gcaacattta aggtggatga ggctgcattg aacgcgacaa 12420 gtgaaaccga tgataacttc gcttgcacca cgagctcagg tgcagttgaa aggggaagtt 12480 tgggaccatt tggtcttgcg gttctagctg atggaaccct ttccgagtta actccggttt 12540 atttctacat tgctaaaaag gccgatggag gtgtgtcaac acatttttgt accgataagc 12600 taaggtcatc cttggatttt gataaggaga gagtggtgta cggtagcact gttcctgtgt 12660 tagatgatga agaactcaca atgaggctat tggtggatca ttcggtagtc gaggcgtttg 12720 cacaaggagg aaggattgcc ataacatcaa gggtgtatcc gacgaaagca atatacgaag 12780 gagcgaagtt gttcttattc aacaatgcca cggatacgag tgtgaaggca tctctcaaga 12840 tttggcaaat ggcttctgcc caaattcatc aatacgagtt taattaggcg gccgcgacac 12900 aagtgtgaga gtactaaata aatgctttgg ttgtacgaaa tcattacact aaataaaata 12960 atcaaagctt atatatgcct tccgctaagg ccgaatgcaa agaaattggt tctttctcgt 13020 tatcttttgc cacttttact agtacgtatt aattactact taatcatctt tgtttacggc 13080 tcattatatc cg 13092 

What is claimed is:
 1. A plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to at least a portion of at least one coding sequence for a plant fructosyltransferase, operably linked to a vacuole targeting sequence, said molecule sufficient to express a protein capable of producing fructan having a degree of polymerization of at least three, in an embryo of said plant, or any progeny thereof, wherein said progeny comprise said molecule.
 2. The plant of claim 1 wherein said fructan is inulin.
 3. The plant of claim 1 wherein said fructosyltransferase is selected from the group consisting of sucrose:sucrose fructosyltransferase, and the combination of sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase.
 4. The plant of claim 1 wherein said plant is a cereal.
 5. The plant of claim 1 wherein said plant is corn.
 6. The plant of claim 1 wherein said plant is soybean.
 7. The plant of claim 1 wherein said coding sequence for fructosyltransferase is selected from the group consisting of a monocot and a dicot.
 8. The plant of claim 1 wherein said at least one fructosyltransferase comprises a sucrose:sucrose fructosyltransferase.
 9. The plant of claim 1 wherein said at least one fructosyltransferase comprises a first fructosyltransferase and a second fructosyltransferase, wherein said first fructosyltransferase comprises sucrose:sucrose fructosyltransferase and said second fructosyltransferase comprises fructose:fructose fructosyltransferase.
 10. The plant of claim 1 wherein said at least one DNA molecule comprises a first DNA molecule and a second DNA molecule, said first DNA molecule comprises sucrose:sucrose fructosyltransferase and said second DNA molecule comprises fructose:fructose fructosyltransferase.
 11. The plant of claim 1 wherein said at least one DNA molecule comprises sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase.
 12. Fructan produced by the enzyme encoded by the coding sequence in the recombinant DNA molecule from the plant of claim
 1. 13. A recombinant DNA molecule comprising an embryo specific promoter operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, operably linked to a vacuole targeting sequence, said molecule sufficient to express a protein capable of producing fructan in an embryo cell.
 14. The recombinant DNA molecule of claim 13 wherein said fructan is inulin.
 15. The recombinant DNA molecule of claim 13 wherein said fructosyl-transferase is selected from the group consisting of sucrose:sucrose fructosyltransferase, and the combination of sucrose:sucrose fructosyltransferase and fructose:fructose fructosyltransferase.
 16. A method of producing fructan in a plant comprising: a) constructing at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, b) transforming a plant cell with said construct, c) regenerating said plant to produce seed, d) harvesting seed from said plant of step c, and e) extracting fructan from seed of step d.
 17. The method according to claim 16, wherein said fructan is inulin.
 18. A method of screening transgenic maize tissue expressing embryo targeted genes comprising: a) preparing Type-II maize callus for transformation, b) transforming callus, c) selecting transgenic callus lines, d) regenerating transgenic somatic embryos, and e) propagating transgenic somatic embryos for both plant production and early trait analyses.
 19. A foodstuff comprising fructan obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, said molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of said plant, or any progeny thereof, wherein said progeny comprise said molecule.
 20. A foodstuff comprising inulin obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, said molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of said plant, or any progeny thereof, wherein said progeny comprise said molecule.
 21. An industrial product comprising fructan obtained from a plant comprising at least one recombinant DNA molecule comprising an embryo specific promoter operably linked to a vacuole targeting sequence operably linked to at least a portion of at least one coding sequence for a fructosyltransferase, said molecule sufficient to express a protein capable of producing fructan of at least DP3 in a grain of said plant, or any progeny thereof, wherein said progeny comprise said molecule.
 22. The industrial product of claim 21 selected from the group consisting of a hydrocolloid, a bleach activator, a dispersing agent, a glue, and a biodegradable complexing agent.
 23. Grain of the plant of claim
 1. 24. Grain of the plant of claims 5 or
 6. 25. A foodstuff comprising grain of the plant of claim
 1. 26. A foodstuff comprising grain of the plant of claims 5 or
 6. 